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Preface 


The current volume brings together several contributions from the 
invited speakers and guests of the workshop ‘Groups, Graphs and 
Random Walks’ held in Cortona (Italy) on June 2 to 6, 2014, on the 
occasion of the sixtieth anniversary of Wolfgang Woess. 

Wolfgang was born in Vienna on July 23, 1954, to Friedrich and 
Elisabeth Woess, both professors at the University of Vienna. His father 
was also a gifted painter: when visiting Wolfgang, one immediately gets 
attracted to Friedrich Woess’s beautiful watercolor landscapes adorning 
the walls of his office at the university as well as of his cosy home. 

Wolfgang studied mathematics at the Technical University of Vienna, 
where he obtained his diploma, at the University of Munich, and at 
the University of Salzburg, where he obtained his PhD under the 
supervision of Peter Gerl. After a period as an assistant professor 
at the Montanuniversitat Leoben (1984-1989)—including a leave of 
absence at the University of Rome ‘La Sapienza’ (1984-1985), where 
he started a long and fruitful collaboration with the Italian Harmonic 
Analysis group led by Alessandro Figa-Talamanca—and eleven years 
as a professor at the University of Milan (1988-1999), he eventually 
became Professor at the Graz University of Technology in 1999, where 
he currently serves as the chair of the Institute of Discrete Mathematics. 

Wolfgang has been and still is, for many of us, a great teacher, a 
colleague, and a dear friend. As a teacher, he had thirteen PhD students 
(essentially from the University of Milan and the Graz University 
of Technology) and several postdoctoral fellows who have obtained 
important recognition both at the scientific and the academic levels. 

His publications (nearly a hundred) range among various mathemat- 
ical subjects, including convolution powers of probability measures on 
groups and asymptotics of random walk transition probabilities (at the 
very beginning of Wolfgang’s research); recurrence, spectral radius and 
amenability, and spectral computations; boundary theory and harmonic 
functions; infinite electrical networks; context-free languages and their 
relations with groups and random walks; infinite graphs and groups; 
random walks on affine groups, buildings, horocyclic products, and 
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lamplighter groups; finally and more recently, reflected random walks and 
stochastic dynamical systems; Brownian motion on strip (‘quantum’) 
complexes, treebolic spaces and SOL Geometry, and Markov processes 
on ultra-metric spaces. The long list of collaborators (more than thirty) 
includes, in order of multiplicity: Massimo Picardello, Laurent Saloff- 
Coste, Donald Cartwright, Vadim Kaimanovich and his former student 
Sara Brofferio. 

One should also mention his beautiful and masterly written mono- 
graphs Random Walks on Infinite Graphs and Groups (Cambridge 
University Press, 2000) and Denumerable Markov Chains— Generating 
Functions, Boundary Theory, Random Walks on Trees (European Math- 
ematical Society Publishing House, 2009). 

As mentioned, in the present volume we collect some papers con- 
tributed by participants to the Cortona conference: the themes are all 
intimately related to Wolfgang’s research interests and scientific produc- 
tion. Here we overview, with a brief description, these contributions. 


Growth of Groups and Wreath Products 

Laurent Bartholdi (Georg-August University of Gottingen) 

The central theme of this survey chapter is the Bartholdi—Erschler 
construction, via wreath products, of many groups of diverse types of 
growth: either intermediate, with many different growth functions, or of 
non-uniform exponential growth. On the way, Bartholdi discusses current 
hot topics of geometric group theory such as self-similar groups, branch 
groups, finite-automata groups, rooted trees, complete growth series and 
the like. 


Random Walks on Some Countable Groups 

Alexander Bendikov (Wroclaw University) and Laurent Saloff-Coste 
(Cornell University) 

The chapter by Bendikov and Saloff-Coste studies decay of convolution 
powers of probability measures on non-finitely generated countable 
groups. Their methods are primarily based on explicit calculations of 
convolution powers of convex combinations of Haar measures and com- 
parison techniques. It contains, in particular, an interesting collection 


of precise estimates for random walks on the infinite symmetric group 
Sie, 


The Cost of Distinguishing Graphs 

Debra Boutin (Hamilton College) and Wilfried Imrich (Leoben 
University) 

Boutin and Imrich study the notion of distinguishing cost of a graph, 
recently introduced by the first author, which is defined as the smallest 
size of a vertex set whose set-wise stabilizer in the automorphism group 
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is trivial, and therefore constitutes a measure of the symmetry of the 
given graph. Clearly, it exists if and only if the distinguishing number 
(minimal number of colors needed for a coloring, which is not preserved 
by any non-trivial automorphism) is at most two. Furthermore, it is 
always bounded from below by the minimal size of a base (set whose 
point-wise stabilizer is trivial). Thus, the distinguishing cost could serve 
as a finer measure of the degree of symmetry for graphs with equal 
distinguishing number. 


A Construction of the Measurable Poisson Boundary 

Sara Brofferio (Paris-Sud University) 

The chapter by Brofferio addresses an important problem about Poisson 
boundaries of random walks. Recall that, given a measure mu on 
a locally compact group G, the Poisson boundary is a measurable 
G-space (X,v) with w * v = v such that the Poisson transform @ bh 
fo(g) := Sx b(gx)dv(«) defines an isometry of L°(X,o * v) onto the 
space H>°(G) of bounded A-a.e. 4-harmonic functions on G (here p is 
a probability measure on G equivalent to the Haar measure i, and a 
function f: G > R is termed A-a.e. jz-harmonic if f(g) = Sof gh) du(h) 
for A-a.e. g € G). When yu is supported on a dense countable subgroup [ 
of G, there are two notions of Poisson boundary: one (as above) on G and 
one on I’ endowed with the discrete topology and the counting measure. 
In this chapter a kind of inductive construction is proposed to obtain the 
G-Poisson boundary from the I'-Poisson boundary. Consider the action 
of F on Gx X defined by y *(g, x) := (gy, yx). Then Brofferio proves 
that the quotient space associated with the T-invariant sets is a kind of 
G-Poisson boundary. This is applied to describe the Aff(p,R) Poisson 
boundary of the Baumslag—Solitar group BS(1, p), where Aff(p, R) is the 
closure of the usual representation of BS(1,p) in the group Aff(R) of 
affine transformation of the real line. 


Structure Trees, Networks and Almost Invariant Sets 

Martin J. Dunwoody (University of Southampton) 

Stallings’ celebrated theorem (1968) about ends of groups states that a 
finitely generated group G has more than one end if and only if it admits 
a non-trivial decomposition as an amalgamated free product or an HNN 
extension over a finite subgroup. In the modern language of Bass—Serre 
theory, the theorem says that a finitely generated group G has more 
than one end if and only if it admits a non-trivial (that is, without a 
global fixed point) action on a simplicial tree with finite edge-stabilizers 
and without edge-inversions. This fundamental result, together with 
a question formulated by Wall, was a starting point for Dunwoody’s 
accessibility theory: a finitely generated group G is said to be accessible 
if the process of iterated nontrivial splittings of G over finite subgroups 
always terminates in a finite number of steps. Dunwoody (1985) proved 
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that every finitely presented group is accessible; he later showed that 
there do exist finitely generated groups that are not accessible. The 
notion of accessibility was later extended to the graph setting by 
Thomassen and Woess (1993): a graph is accessible if there is an integer 
n such that any two ends can be separated by removing at most n 
edges; in the same chapter, the authors obtain a number of results using 
structure trees. Dunwoody gives a self-contained account of the theory, 
initiated with the aforementioned works of Stallings and later developed 
by Dicks and Dunwoody, of structure trees for edge cuts in networks. 
Applications include a generalization of the Max-Flow Min-Cut theorem 
to infinite networks, a short proof of a conjecture of Kropholler, a relative 
version of Stallings theorem and a generalization of the Almost Stability 
theorem by Dicks and Dunwoody. 


Amenability of Trees 

Behrang Forghani (University of Connecticut) and Keivan Mallahi-Karai 
(Jacobs University of Bremen) 

The chapter by Forghani and Mallahi-Karai gives a necessary and 
sufficient condition for a tree to be amenable. As an application of this 
result, it is proven that a Galton—Watson tree is, under some specific 
conditions, almost surely amenable. 


Group Walk Random Groups 

Agelos Georgakopoulos (University of Warwick) 

The chapter by Georgakopoulos discusses a new class of random graphs 
that combines ideas from random graph theory as well as random walks. 
Take an infinite graph, and let G,, denote the intrinsic ball of radius n 
around a root vertex. Then construct the random graph on the boundary 
dGn = Gn \ Gn_-1 by letting random walks start in all the vertices in 
dGp and connecting 7,y € dG, when the random walk starting in x 
leaves Gp in y, or vice versa. This is an entirely novel construction 
of a random graph, somehow interpolating between normal random 
graphs and random walk interlacements. Georgakopoulos studies various 
properties of the graph, such as the number of connections between 
macroscopic parts of 0G; as n > ov, relations to the Poisson boundary 
of graphs and effective conductances, and the Doob—Naim’s kernel. 


Ends of Branching Random Walks on Planar Hyperbolic 
Cayley Graphs 

Lorenz A. Gilch (Graz University of Technology) and Sebastian Miiller 
(Aiz-Marseille University) 

Properties of branching random walks (BRWs) on a graph G are of 
interest when the trace of the process (the random subgraph formed 
of vertices visited by particles of BRW) is a proper subgraph of G. The 
authors study the trace of transient BRWs when G is a planar hyperbolic 
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Cayley graph: in this case it is shown that the trace of BRW has, almost 
surely, infinitely many ends. 


Amenability and Ergodic Properties of Topological Groups: 
From Bogolyubov Onwards 

Rostislav Grigorchuk (Texas A&M University) and Pierre de la Harpe 
(University of Geneva) 

The theory of amenable groups emerged from the study of the axiomatic 
properties of the Lebesgue integral and the discovery of the Hausdorff-— 
Banach—Tarski paradox at the beginning of the last century. The first 
definition of an amenable group, by the existence of an invariant finitely 
additive probability measure, is due to von Neumann in his 1929 seminal 
paper. The term amenable was introduced in the 1950s by M.M. Day, 
who played a central role in the development of the modern theory of 
amenability by using means and applying techniques from functional 
analysis. In these references, groups do not have topology; in other words, 
they are just ‘discrete groups’. Amenability is considered explicitly for 
topological groups in later articles by Dixmier, Fomin and Rickart and 
(for locally compact groups) by Greenleaf in his influential book. The 
chapter by Grigorchuk and de la Harpe is a survey on amenability and 
ergodicity for topological groups (as opposed to locally compact groups), 
as in Bogolyubov’s 1939 paper, emphasizing the characterizations true 
in general and those true only in the locally compact case. This is of 
particular importance in view of the recent renewed interest in ‘large’ 
groups, for example, topological full groups of Cantor minimal dynamical 
systems (these groups are sources of infinite simple non-elementary 
amenable groups). 


Schreier Graphs of Grigorchuk’s Group and a Subshift Associ- 
ated to a Non-Primitive Substitution 

Rostislav Grigorchuk (Texas A&M University), Daniel Lenz (Friedrich 
Schiller University Jena) and Tatiana Nagnibeda (University of Geneva) 
The authors describe a remarkable connection between a class of Laplace- 
type operators on Schreier graphs associated with the Grigorchuk group 
of intermediate growth and a class of Schrodinger operators with poten- 
tials defined by sampling over a strictly ergodic aperiodic subshift defined 
by a non-primitive substitution. This beautifully elucidated connection 
provides an example of a reduction of a hard (read: not strictly one- 
dimensional) problem to an easy (read: one-dimensional) problem. From 
this point of view, it is reminiscent of work on the XY spin chain, which 
reduces the hard (many-body) problem to an easy (one-particle) effective 
Hamiltonian via the Jordan—Wigner transformation. This chapter also 
includes a substantial and clear discussion of the spectral characteristics 
of aperiodic one-dimensional Schréodinger operators, the definition and 
properties of the Grigorchuk group, the structure of the substitution t 
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and its subshift, and the relationship between this subshift and the 
graphs associated with the action of Grigorchuk’s group on the infinite 
binary tree. 


Thompson’s Group F Is Not Liouville 

Vadim Kaimanovich (University of Ottawa) 

Thompson’s group F’, introduced by Richard Thompson in 1965, is the 
(finitely presented, infinite) group consisting of all orientation preserving 
piecewise-linear dyadic self-homeomorphisms of the closed unit interval 
[0, 1]. One of the most important open questions in geometric group the- 
ory is whether or not Thompson’s group F’ is amenable. Note that Brin 
and Squier proved that F does not contain non-abelian free subgroups: 
as a consequence, if F is not amenable, then it would constitute another 
counterexample to the so-called ‘von Neumann conjecture’, which stated 
that a finitely generated group is amenable if and only if it does not 
contain non-abelian free subgroups. The first finitely generated (resp. 
finitely presented) counterexamples to this conjecture were found by 
Olshanskii in 1980 using his Tarski monsters (resp. by Olshanskii and 
Sapir in 2002). 

The classical Liouville theorem asserts that the only bounded har- 
monic functions on Euclidean space are the constants. Now, the notion 
of a harmonic function (based on the mean value property) can in 
fact be defined for an arbitrary Markov chain and, in particular, for 
random walks on groups. Given a probability measure w on a group 
G, denote by H™(G,u) = {f € €~(G) : f = f * uw} the space 
of all bounded p-harmonic functions on G. One then says that the 
random walk (G, 2) is Liouville if H°(G, jz) consists only of the constant 
functions. Furstenberg (1973) proved that any group carrying a non- 
degenerate Liouville random walk is amenable. One calls a group G 
Liouville if (G, w) is Liouville for any symmetric and finitely supported 
probability measure « on G. In his chapter, Kaimanovich shows that 
Richard Thompson’s group F is not Liouville. More precisely, he proves 
that the random walk on F driven by any strictly non-degenerate finitely 
supported probability measure yu has a non-trivial Poisson boundary. 


An Alternative Proof of the Subadditive Ergodic Theorem 
Anders Karlsson (University of Geneva and Uppsala universitet) 
Karlsson’s chapter contains a self-contained, clean and short proof of 
Kingman’s celebrated subadditive ergodic theorem. The scheme of the 
proof follows the classical proof of Birkhoff’s ergodic theorem, via the 
maximal ergodic lemma. There are, however, two important differences: 
first, the maximal ergodic lemma is replaced by a result by Derriennic 
(1975), which yields non-positivity of a limit integral lim [, *%" du(z) 
(as opposed to the usual te a(1,x)dj(x), which is the same in the case 
of additive cocycles), and, second, there is a clever argument at the end 
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of the proof to derive everywhere convergence in the subadditive case, 
from the same statement for additive cocycles. 


Boundaries of Z”-Free Groups 

Andrei Malyutin (St. Petersburg Department of V.A. Steklov Mathe- 
matical Institute), Tatiana Nagnibeda (University of Geneva) and Denis 
Serbin (Stevens Institute of Technology) 

Given an arbitrary ordered Abelian group A (for instance, Z or R), 
a A-tree is a metric space whose metric takes values in A and is subject 
to certain tree axioms. This notion goes back to the early 1960s, when 
Lyndon introduced the notion of abstract length functions on groups, 
and, a few years later, to Chiswell, who related such length functions 
with group actions on Z- and R-trees — providing a construction of 
the tree on which the group acts — and to Tits, who explicitly gave 
the first formal definition of an R-tree. This theory has significantly 
developed since then, the main problem being addressed is to find the 
group theoretic information carried by a A-tree action, in particular, 
the structure of A-free groups. Malyutin, Nagnibeda, and Serbin study 
the Poisson boundary of groups acting on Z"-trees (here Z” is equipped 
with the right lexicographic order). The groups considered in this chapter 
constitute a natural generalization of free groups; the class of Z"-free 
groups includes, in particular, all limit groups, and is closed under taking 
subgroups, free products and amalgamated free products along maximal 
cyclic subgroups. The authors provide a construction of the Poisson 
boundary for these groups directly in terms of the action of the group 
on the Z”-tree. 


Buildings, Groups of Lie Type, and Random Walks 

James Parkinson (University of Sydney) 

Parkinson masterly surveys the fascinating theory of random walks on 
buildings and associated groups of Lie type and Kac-Moody groups. The 
author does a beautiful job of explaining ideas that are potentially quite 
technical and provides a comprehensive update of recent results dealing 
with probability theory on groups of Lie type defined over other p-adic 
fields, and extensions of these results into the setting of Kac-Moody 
groups. The unifying feature is the combinatorial-geometrical notion of 
a building, introduced by Tits in the 1950s in his successful attempt 
to give a uniform geometric interpretation of semi-simple Lie groups. 
Here the author focuses on the classes of buildings on which random 
walks have been studied, including spherical buildings, affine buildings 
(playing an important role in the study of Lie groups over p-adic fields), 
and Fuchsian and twin buildings (extensively used in the theory of 
Kac—Moody groups) and shows how the theory of random walks on build- 
ings leads to limit theorems for random walks on the associated groups. 
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On Some Random Walks Driven by Spread-out Measures 
Laurent Saloff-Coste (Cornell University) and Tianyi Zheng (Stanford 
University) 

The chapter by Saloff-Coste and Zheng concerns heat kernel estimates for 
random walks on finitely generated groups and focuses on the case when 
the groups have polynomial volume growth and the initial distribution 
of the random walk has a suitable (slow) decay. The techniques used 
allow for the treatment of a variety of examples. The authors also give 
an application of the method to wreath products. 


Topics in Mathematical Cristallography 

Toshikazu Sunada (Meiji University) 

The volume ends with the survey by Sunada. It is based on his book 
Topological Crystallography and, in connection with tight frames, sheds 
light, with a new geometric insight, on the relationship of his study 
on the standard realization of crystal lattices with several different 
topics such as tight frames in Euclidean space, crystallography, algebraic 
geometry (rational points in Grassmannians, the Abel-Jacobi map), 
number theory (quadratic Diophantine equations) and combinatorics 
(spherical designs). 
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Introduction 


These notes are an expanded version of a mini-course given at Le 
Louverain, June 24—27, 2014. Its main objective was to gather together 
useful facts about wreath products, and especially their geometry, in 
its application to problems and questions about growth of groups. The 
wreath product is a fundamental construction in group theory, and I hope 
to help make the reader more familiar with it. 

It has proven very useful, in recent years, in better understanding 
asymptotics of the word growth function on groups, namely the function 
assigning to R € N the number of group elements that may be obtained 
by multiplying at most R generators. The chapters [4-8] contain many 
repetitions as well as references to outer literature; by providing a unified 
treatment of these articles, I may provide the reader with easier access 
to the results and methods. 

I have also attempted to define all notions in their most natural 
generality, while restricting the statements to the most important or 
fundamental cases. In this manner, I would like the underlying ideas to 


Partially supported by ANR grant ANR-14-ACHN-0018-01 and DFG grant 
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appear more clearly, with fewer details that obscure the line of sight. 
I avoided as much reference to literature as possible, taking the occasion 
to reprove some important results along the way. 

I have also allowed myself, exceptionally, to cheat. I do so only under 
three conditions: (1) I clearly mark where there is a cheat; (2) the 
complete result appears elsewhere for the curious reader; (3) the correct 
version would be long and uninformative. 

I have attempted to make the text suitable for a short course. In 
doing so, I have included a few exercises, some of which are hopefully 
stimulating, and a section on open problems. What follows is a brief tour 
of the highlights of the text. 


Wreath Products 

The wreath product construction, described in 81, is an essential 
operation, building a new group W out of a group H and a group G 
acting on a set X. Assuming! that G is a group of permutations of X, the 
wreath product is the group W = Hxy G of H-decorated permutations 
in G: if elements of G are written in the arrow notation, with elements 
of X lined in two identical rows above each other and an arrow from 
each x € X to its image, then an element of Hwy G is an arrow diagram 
with an element of H attached to each arrow, e.g. 


SS Pa ~ 
hy. hg 
Pat St 


One of the early uses of wreath products is as a classifier for 
extensions, as discovered by Kaloujnine, see Theorem 1.4: there is 
a bijective correspondence between group extensions with kernel H 
and quotient G on the one hand, and appropriate subgroups of the 
wreath product HeG, with G seen as a permutation group acting on 
itself by multiplication. We extend this result to permutational wreath 
products: 


Theorem (Theorem 1.6). Let G,H be groups, and let G act on the 
right on a set X. Denote bya: HxG => G the natural projection. Then 
the map Et E defines a bijection between 


E:E <4 and the G-sets X and H\E are isomorphic 
via a homomorphism E > G 


isomorphism E — E" of groups intertwining the actions on H\E and H\E' 


1 There is no need to require the action of G on X to be faithful; this is merely a 
visual aid. See §1 for the complete definition. 
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and 
E < HeG:xn(£) transitive on X and 
ker(1) NE le via fr f(x) for allae X | 
conjugacy of subgroups of HaG 
The wreath product Hwy G is uncountable, if H ~# 1 and X is 
infinite. It contains some important subgroups: Hx G, defined as those 


decorated permutations in which all but finitely many labels are trivial; 
and H ee G, defined as those decorated permutations in which the labels 


take finitely many different values. Clearly H2x GC H ae GC AxexG, 
and H vx G is countable as soon as H, G, X are countable. 


Growth of Groups 

Let us summarise here the main notions; for more details, see §2. A choice 
of generating set S for a group G gives rise to a graph, the Cayley graph: 
its vertex set is G, and there is an edge from g to gs for each g € G, 
s € §. The path metric on this graph defines a metric d on G called the 
word metric. The Cayley graph is invariant under left translation, and 
so is the word metric. 

One of the most naive invariants of this graph is its growth, namely 
the function vg,g(2) measuring the cardinality of a ball of radius R in 
the Cayley graph. If the graph exhibits some kind of regularity, then it 
should translate into some regularity of the function vg,g. 

For example, Klarner [47, 48] studied the growth of crystals (that 
expand according to a precise and simple rule) via what turns out to be 
the growth of an abelian group. 

A convenient tool to study various forms of regularity of a function 
vg,s is the associated generating function [g_5(z) = Vo ren(Va,5(R) — 
Va,s(R-— 1))z”. The regularity of vg,g translates then into a property 
of 'g,g such as being a rational, algebraic, D-finite, ... function of z. 

We may rewrite [g.g(z) = vee z4(1,9). then a richer power series 
keeps track of more regularity of G: 


Pos) = Ss 9289), 
géG 


This is a power series with coefficients in the group ring ZG, and again 


we may ask whether To, g is rational or algebraic?. 


2 The rational subring of ZG|[t]] is the smallest subring of ZG|[t]] containing ZG[E] 
and closed under Kleene’s star operation A* = 1+ A+ A?+-.--, for all A(z) with 
A(0) = 0. 

The algebraic subring of ZG|[t]| is the set of power series that may be expressed 
as the solution A, of a non-trivial system of non-commutative polynomial equations 
{P1(Aq,...,; An) = 0,:--,Pn(A1,...,An) = 0} with coefficients in ZG[t]. The 
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If G has an abelian subgroup of finite index [52], or if G is word- 
hyperbolic [33], then 'g,g is a rational function of z for all choices of S. 
We give a sufficient condition for 'g,s to be algebraic: 


Theorem (Theorem 3.2). Let H = (T) be a group such that Tar is 
algebraic, and let F be a free group. Consider G = H2 F’, generated by 
S= TU {a basis of F}. Then Tq,g is algebraic. 


We then turn to studying the asymptotics of the growth function vgs. 
Let us write v S w to mean that v(R) < w(CR) for some constant 
C e€ Ry and all R > 0, and v ~ w to mean v X w XZ v. Then the 
~-equivalence class of vg,g is independent of the choice of S', so we may 
simply talk about vq. 

For ‘most’ examples of groups, either vg(R) is bounded by a poly- 
nomial in R or vg(R) is exponential in R. This is, in particular, 
the case for soluble, linear and word-hyperbolic groups. There exist, 
however, examples of groups for which vg(R) admits an intermediate 
behaviour between polynomial and exponential; they are called groups 
of intermediate growth. The question of their existence was raised 
by Milnor [58], was answered positively by Grigorchuk [29], and has 
motivated much group theory in the second half of the twentieth century. 

Let ny * 2.46 be the positive root of T? — T? — 2T — 4, and set 
a = log2/log n+ * 0.76. We shall show that, for every sufficiently regular 
function f: R, > R}+ with exp(R®) S f XS exp(R), there exists a group 
with growth function equivalent to f: 


Theorem (Theorem 6.2). Let f: Ry > Ry be a function satisfying 
fRR) < f(R)? < f(n+R) for all R large enough. 
Then there exists a group G such that vg ~ f. 


Thus, groups of intermediate growth abound, and the space of 
asymptotic growth functions of groups is as rich as the space of functions. 
Furthermore, we shall show that there is essentially no restriction on the 
subgroup structure of groups of intermediate growth. Let us call a group 
HT locally of subexponential growth if every finitely generated subgroup 
of H has growth function % exp(R). Clearly, if H is a subgroup of a 
group of intermediate growth then it has locally subexponential growth. 
We show, conversely: 


Theorem (Theorem 7.1). Let B be a countable group locally of 
subexponential growth. Then there exists a finitely generated group of 
subezponential growth in which B imbeds as a subgroup. 


solution is actually rational if furthermore the P; are of the form 40+. 5-1 cig Aj 
with Cig E ZG{t]. 
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Finally, it may happen that a group G has exponential growth, namely 
that the growth rate limp_.o vg,s(R)V/F is > 1 for all S, but that the 
infimum of these growth rates, over all S$, is 1. Such a group is called 
of non-uniform exponential growth. The question of their existence was 
raised by Gromov [35, Remarque 5.12]. Again, soluble, linear and word- 
hyperbolic groups cannot have non-uniform exponential growth; but, 
again, it turns out that such groups abound. We shall show: 


Theorem (Theorem 8.2). Every countable group may be imbedded in 
a group of non-uniform exponential growth. 

Furthermore, the group W in which the countable group imbeds may 
be required to have the following property: there is a constant K such 
that, for all R > 0, there exists a generating set S of W with 


vw,s(r) < exp(Kr®) for all r € [0, RI. 


(Self-)Similar Groups and Branched Groups 

All the constructions mentioned in the previous subsection take place in 
the universe of (self-)similar groups. Here is a brief description of these 
groups; see §4 for details. 

Just as a self-similar set, in geometry, is a set describable in terms 
of smaller copies of itself, a self-similar group is a group describable in 
terms of ‘smaller’ copies of itself. A self-similar structure on a group 
G is a homomorphism ¢: G > Gx P for a permutation group P 
of X. Thus, elements of G may be recursively written in terms of 
G-decorated permutations of X. For this description to be useful, of 
course, the homomorphism ¢ must satisfy some non-degeneracy condi- 
tion (in particular be injective), and P should be manageable, say finite. 

The fact that the copies of G in G2x P are ‘smaller’ than the original 
is expressed as follows: there is a norm on G such that, for g € G and 
o(g) a permutation with labels (gz : x € X), the elements gz are shorter 
than g, at least as soon as g is long enough. For example, consider G 
finitely generated, and denote by ||- || the word norm on G. One requires 
llgc || < Ilgl| for all « € X and all |lg|| >> 1; this is equivalent to the 
existence of 4 € (0,1) and K > 0 such that |lgz|l < Allg|| + A for all 
crEeX,geG. 

Furthermore, in cases that interest us, the map @ is almost an 
isomorphism, in that its image #(G) has finite index in Gy P. Thus, ¢ 
may be thought of as a virtual isomorphism between G and G* , namely 
an isomorphism between finite-index subgroups. When one endows 
G* with the 2 metric (gr) || = maxzex |lgzrll, the aforementioned 
condition requires that this virtual isomorphism be a contraction. On 
the other hand, endowing G* with the ¢! metric (gx) Il = Yonex Ilgell, 
the optimal Lipschitz constant of the virtual isomorphism plays a 
fundamental role in estimating the growth of G. 
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Similar groups are a natural generalization: one is given a set Q and a 
self-map 0: Q; for each w € Q, a group Gy and a permutation group 
P, of a set X,,; and homomorphisms ¢,: Gy > Gow 2x, Po. Taking for 
Q a singleton recovers the notion of a self-similar group. Taking Q = N 
and o(n) = n+ 1 defines in full generality a similar group Go; but it 
is often more convenient to consider a larger family of groups in which 
(Gn) nen imbeds. In particular, one obtains a topological space of groups, 
in such a manner that close groups have close properties (for example, 
their Cayley graphs coincide on a large ball). 
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Open Problems 

This text presents a snapshot of what is known about growth of groups in 
2014; a large number of open problems remain. Here are some promising 
directions for further research. 


1. Which groups G are such that, for all generating sets S, the 
complete growth series Tes is a rational function of z? 

This is known to hold for virtually abelian groups, and for word- 
hyperbolic groups. Conjecturally, this holds for no other group. 

The related question of which groups have a rational (classical) 
growth function is probably more complicated, see §2.1. 

2. Is the analytic continuation 1 /T¢, g(1) related to the complete Euler 
characteristic of G, just as 1/Pg g(1) is (under some additional 
conditions) the Euler characteristic of G? See [74, §1.8] for complete 
Euler characteristic. 

3. Does there exist a group G with two generating sets 51,52 such 
that iy) $, is rational but iwel ,S2 is transcendental? 

Such an example could be G = F2 x Fo. Set S = {2, y}=! a free 
generating set of Fo, and S; = Sx {1}u{1}x S and $9 = S,U{(s, 5) : 
5 € S}. 

The same properties probably hold for the usual generating series 
Tg.s, and [g_s,. The radius of convergence of [g_g, is 1/3, but 
that of "gg, is unknown. 
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This problem is strongly related to the ‘Matching subsequence 
problem’, which asks for the longest length of a common subse- 
quence among two independently and uniformly chosen words of 
length n over a k-letter alphabet; see [16]. It is easy to see that, for 
two uniformly random reduced words of length n in Fo, the longest 
common subword has length ~ yn for some constant y, as n — od. 
Thus, a pair (g,h) € G with ||g|| = ||Al| = 1 has length 2n with 
respect to 51, but approximately (2 — a@)n with respect to Sg. We 
might call y the Chudtal-Sankoff constant of Fo. 

4. Are there infinite simple groups of subexponential growth?? 

There is no reason for such groups not to exist; but the construc- 
tion methods described in this text yield groups acting on rooted 
trees, which therefore are as far as possible from being simple. 

There is also no reason for finitely presented groups of subexpo- 
nential growth not to exist; again, the obstacle is probably more our 
mathematical limitations than fundamental mathematical reasons. 

The following question, by de la Harpe [38], is still open at the 
time of writing: ‘Do there exist groups with Kazhdan’s property (T) 
and non-uniform exponential growth?’ 

Similarly, it is not known whether there are simple finitely 
generated groups of non-uniform exponential growth, and whether 
there are finitely presented groups of non-uniform exponential 
growth. 

5. Are there groups whose growth function lies strictly between 
polynomials and exp(R!/?)? 

See the discussion in §2.3. There is a superpolynomial function 
f(R & Rive Ry such that no group has growth strictly between 
polynomials and f(R). There is no residually nilpotent group 
whose growth is strictly between polynomials and exp(R!/2), see 
Theorem 5.2. 

6. What is the asymptotic growth of the first Grigorchuk group? 
What is its exact growth, for the generating set {a, b,c, d}? Does 
the growth series of the Grigorchuk group exhibit some kind of 
regularity? 

Some experiments indicate that this must be the case. For 
example, consider the quotient G,, of the first Grigorchuk group that 
acts on {0,1}”. It is a finite group of cardinality 25:2" 842 For n <7, 
the diameter Dy, of its Cayley graph (for the natural generating set 
{a, b,c, d}) is the sequence 1, 4,8, 24, 56,136,344 and satisfies the 
recurrence Dyn = Dn—1 + 2Dn—2 + 4Dn_3. If this pattern went on, 
the growth of the first Grigorchuk group would be asymptotically 
exp( R98 2/log M4) exp(R°-7), 


3 This question has been answered positively by V. Nekrashevych, see 
https://arxiv.org/abs/1601.01033. 
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If a group has subexponential growth, then its growth series is 
either rational or transcendental, and if the group has intermediate 
growth, then the growth series must be transcendental; See 
Theorem 2.1 in §2.1. Thus, Grigorchuk group’s growth series 
is transcendental. Does the series satisfy a functional equation? 
That would make it akin to the classical partition function 
Yaso Pe" =I] = z”)-!, which (up to scaling and 
multiplying by z!/ ae) is a modular function. Ghys asked me 
once: ‘Is the growth series ['(z) of Grigorchuk’s group modular?’ 

7. For every k € N, the space of marked k-generated groups .%, may 
be defined as the space of normal subgroups of the free group Fy, 
by identifying G = (s1,...,5,) with the kernel of the natural map 
Fy, — G sending generator to generator. It is a compact space. 
What properties does the set 4 of groups of intermediate growth, 
and the set .V of groups of non-uniform exponential growth, enjoy 
in this space? For example, 

‘Is there an uncountable open subset of .% in which -/, or -¥, 
is dense? Is .V dense in the complement of groups of polynomial 
growth?’ 

Recall that similar groups are families of groups (G.)weg indexed 
by a space Q. If all G, are k-generated, we obtain a map Q > .%, 
which under favourable circumstances is continuous. This has been 
exploited, for example, in [60] to produce groups of non-uniform 
exponential growth. 

It had actually been doubted, before Grigorchuk’s discovery [29], 
whether there were groups of intermediate growth. This text tries 
to convince the reader that they are abundant. Giving a precise 
meaning to the above question would quantify, in some manner, 
the extent to which they are abundant. 


Notational Conventions 
I try to adhere to standard group-theoretical notation. In particular, the 
right action of a group element g on a point x is written zg, and a left 
action is written 9x. The stabilizer of x is written G,. The conjugation 
action of a group on itself is written g” = h—!gh, and the commutator 
of two elements is [g,h] = g~!h-!gh = g7!g" =h-Gh. 

IT also introduce a minimal amount of new, ‘fancy’ notation to represent 
elements of wreath products or of self-similar groups, and hope that it 
helps in achieving clarity and conciseness. 


4 Te. a function A(z) = A(exp(27it)) such that the corresponding function t 
A(exp(2zit)) on the upper half plane is invariant under a finite-index subgroup of 
SL9(Z). 
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1. Wreath Products 


We start with the basic construction. Let H be a group, and let G be a 
group acting on the right on a set X. We construct two groups: 


Arx G= (11) xG the restricted wreath product, 
x 


ArxyG= (1 #) x G the unrestricted wreath product. 
xX 


Here the unrestricted product [| H may be viewed as the group of 
functions X — H, with pointwise composition and with left G-action 
given by pre-composition?: 9f is the function given by (%f)(x) = f (aq). 
The restricted product ITx H is then identified with finitely supported 
functions X — H. In both cases, this product is a subgroup of the 
wreath product and is called its base group. 

In the particular case of X = G with natural right action by 
multiplication, one calls He@G the regular unrestricted wreath product 
and writes it simply HxG; and similarly for the regular restricted wreath 
product Hig G= HG. 

Assume that the action of G on X is faithful; so that elements of G 
may be identified with permutations of X. The best way to describe 
elements of Hxy G or its subgroup Hx G is by decorated permutations: 
one writes a permutation of X, decorated by elements of H, such as 


ay we NSN 
wr as ee 
‘A 
> 


Permutations are multiplied as usual: by stacking them and pulling 
the arrows tight. Likewise, decorated permutations are multiplied by 
stacking them and multiplying the labels along the composed arrows. 
We do not write the labels when they are the identity. Here is a graphical 
computation of a product: 


s Ss ~N 
wo ee ae Preke tale i ee 
hg 
“ey ce. “ea, 5 ee halk 
St 


When writing formule, we must sometimes depart from the graphical 
notation, in which permutations are written top-to-bottom or left-to- 
right and thanks to their arrows there is no ambiguity in knowing in 


5 Note that the side of the action changes! It is best to always use the appropriate 
side, so as to avoid inverses. Recall, however, that every left action can be converted 


=1 
into a right action by setting f9 := 9 f and vice versa. 
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which order to compose the labels. We invariably let permutations act 
on the right on sets, and thus ‘ot’ means ‘first o, then t’. 


Exercise 1.1. In the wreath product W = {+1}?Sym(2), consider the 
element 
aa 
A= 


Show by concatenating the diagram with itself that h has order exactly 4. 
The group W has order 8; which of the order-8 groups is it? 


Let us consider a wreath product W = Hwy G. In writing elements 
w,w’ € W algebraically, we may express them in the form w = fg with 
f: X > Hand gé G, or in the form w’ = gf. In both cases, the element 
g and the function f are unique, by definition of the semidirect product. 
The compositions (fg)(f’g’) and (gf)(g’f’) are, in all cases, computed 
using the relation 


gif =F +9; 


namely 


(fo) (F'9') = fF f(g’) and (gf (g'f’) = (gg) ff’). (1.2) 


Exercise 1.2. Let R be a ring, viewed as a group under addition, 
and let RG denote the group ring of G, on which G acts by right 
multiplication. Show that R?G is isomorphic to RG x G. More generally, 
let X bea G-set; then RX is a G-module. Show that R?x G and RX xG 
are isomorphic. 


1.1. Actions 


Assume now, moreover, that H acts from the right on a set Y. Then 
there are two natural sets on which W = Ary G acts: 


e There is an action on Y x X, given by (y,2z)- fg = (yf(#), xg) for 
(y,z) € Y x X; it is called the imprimitive action; 

e There is an action on Y*, the set of functions X > Y, given by 
(d- fg\(xg) = O(x) f(x) for 6: X > Y; it is called the primitive 
action. 


Exercise 1.3. There are natural bijections between the sets (Z x Y) x 
X=Zx(Y x X) and (Z)* = ZY**, Assume now that a group G 
acts on X, a group H acts on Y and a group J acts on Z. Show that 
the bijections above give isomorphisms between the groups (ty H)?x G 
and (y,x (H 2x G)) as permutation groups, both of Z x Y x X and 
of (ZY)*, 
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1.2. History 


Leo Kaloujnine understood the importance of wreath products in the 
early 1940s. It is said that he worked, during the Second World War, in 
a uniform factory and observed a rivet machine; it was made of a rotating 
ring containing many rotating disks in it. Identifying the movement of 
the ring with an action of G and each subdisk with an action of H, 
one sees that motions of the machine are described by wreath product 
elements. In his dissertation (under Elie Cartan, [44]), he studied the 
Sylow subgroups of symmetric groups and showed that they were iterated 
wreath products. 

However, this description of maximal p-subgroups of symmetric 
groups already appears in the classical 1870 treatise by Camille Jordan 
[43, II.1.41], who implicitly defined wreath products there. This is all 
the more remarkable as Sylow’s theorems were published only two years 
later [78]! 

Kaloujnine returned to the Soviet Union after the war and contributed 
greatly to the development of mathematics in Ukraine, founding in 1959 
the department of algebra and mathematical logic. He is remembered 
for the following important result classifying group extensions, namely, 
that the wreath product is a universal object containing all extensions: 


Theorem 1.4 (Kaloujnine-Krasner, [45]). Let G, H be groups. Denote 
by w: HeG = G the natural projection. Then the map E + E defines 
a bijection between 


{B:1 >H> ESG > 1} 


isomorphism of extensions 


and 


{EB < HxG:n(B) = G and kerr) NE —> H viace Ho ® c(1)} 


conjugacy of subgroups of HaG 


Exercise 1.5. Prove Theorem 1.4. 

Hint: given an extension 1 > H > E >. (hes 1, choose a set- 
theoretic section® g + JG of t, and define an imbedding E > HxG 
by 


er (gr Ge(gr(o)) 1) (0). 


Theorem 1.4 may be interpreted more abstractly as saying that, if 
1—> H > E => G = 1 is an exact sequence and fo: H > K isa 
group homomorphism, then fo extends naturally to a homomorphism 
f: E > KG. The case f = id and H = K is exactly the statement of 
the theorem, and the generalised version is proven in exactly the same 


6 Namely, a map satisfying t(g) = g for all ge G. 
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manner. The advantage of this formulation is that one need not require 
that H be normal in £; and the more general statement is 


Theorem 1.6. Let H < E be groups, and let fo: H > K be a 
homomorphism. Then fy extends naturally to a homomorphism’ f: E > 
Key gE/core(H). 

More precisely, let G,H be groups, and let G act on the right on a 
set X. Denote by wm: HrG > G the natural projection. Then the map 
Ew E defines a bijection between 


E: H<E and the G-sets X and H\E are isomorphic 
via a homomorphism E > G 


isomorphism E — E" of groups intertwining the actions on H\E and H\E' 
and 
E<HxG: x(E) transitive on X and 
| ker(t) NE —> H viace HX % c(2) for alla eX | 
conjugacy of subgroups of HG 


Note that we could have written F instead of E// core(H) everywhere 
above. We included it to obtain a smaller group acting on the set X, 
thus reinforcing the parallel with Theorem 1.4, and expressing FE as 
a subgroup of a presumably easier-to-construct group. In particular, if 
[EF : H| = d < ow, then E/core(#) is a group of cardinality between 
d and d!, since it acts faithfully and transitively on the set H\E of 
cardinality d. 


Proof. We follow the sketched proof of Theorem 1.4; rather than a 
section G > E, we choose a right transversal T of H in E£; namely, a 
subset T C FE such that every e € EH may uniquely be written in the 
form ht with he H,te T. 

For e € E, let f(e) be the following H-decorated permutation of H\E: 
the permutation is given by the natural right-multiplication action of E 
on H\F, and on every edge, say from Ht to Hu if Ht. e = Hu, the label 
is teu7!. 

In formulas, this may be written as follows, although all verifications 
are easier in the ‘decorated permutations’ form. Denote by z the natural 
map EF > E/core(#). Set then f(e) = cm(e) with c: H\E > H given, 
for t € T, by c(Ht) = teu! for the unique u € T such that teu! € H. 

Composing the map c by the homomorphism fo: H — K gives the 
first statement. 


Wreath products have occupied a prominent place in the theory 
of permutation groups. For illustration, they appear as fundamental 


7 Recall that the core of the subgroup H is the intersection of its conjugates. 
E/ core(H) is the natural permutation group acting faithfully on the left coset 
space H\E. 
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constructions in the O’Nan-Scott theorem classifying maximal sub- 
groups of the symmetric group. We skip subcases (3.iii)... (3. viii), which 
are too technical: 


Theorem 1.7 (O’Nan-Scott, [70]). Let G be a maximal subgroup of 
Sym(X) for a finite set X. Then either 


(1) the action of G is not transitive’; then X = YUZ and G = 
Sym(Y) x Sym(Z); or 

(2) the action of G is transitive, but not primitive? ; then X = Y x Z 
and G=Sym(Y)?Sym(Z) in its imprimitive action; or 

(8) the action of G is primitive; and then either 

(3.1) G is an affine group over a finite field; or 
(3.ii) X = Y% and G = Sym(Y)2Sym(Z) in its primitive action; 
or 


(8.iti) ... (B.viti) ... 


1.3. Generators for Wreath Products 


We now consider restricted wreath products W = Hx G in more detail. 
We introduce the following notation: if X be a set and H be a group, 
then for allz € X andhe H, 


(h@x)(z) =h, 


we define hQ@z: X > H by 
(h@z)(y) = 1 for all y Az. 


One has the important formula relating 
conjugation in W with translation: 


(h@x)9 = h@zg 


TRAIN: 


. ALMOST... 


(check it by drawing the decorated permu- 
tations!). 

Let S be a generating set for G, and let 
T be a generating set for H. Let Y C X 
be a choice of one representative from each 
G-orbit on X. Then W is generated by 


{t@y:teT,ye Y}uS. 


A good, but also slightly misleading 


example, is the ‘lamplighter group’: this 4 MACHINE THAT GRABS THE 
EARTH BY METAL RAILS AND 


is the group W = {+1}2Z. It is best — Rorares rr UNTIL THE PART 
understood by its primitive action (on YOU WANT IS NEAR YOU 


the space of functions {+1}7): imagine an (reproduced from https:// 
infinite street with at each integer position xkcd.com) 


8 Te. there are at least two orbits on X. 
9 Le. there is a non-trivial G-invariant equivalence relation on X. 
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a lamppost. The lamp there can be ‘on’ or ‘off’. Imagine also that the 
street is viewed from the perspective of the ‘lamplighter’, namely the 
person in charge of turning various lamps on and off. The generator s 
of Z means ‘move up the street’, or equivalently, ‘shift all lamps down’ 
relatively to the lamplighter; and the generator {@1 means ‘change the 
state of the lamp currently in front of the lamplighter’. 

This is what is expressed by the cartoon above: relativistically 
speaking, it makes no difference to think that a train moves on its tracks, 
or that the tracks move under the train. The lamplighter is on the train, 
and the lamp configurations are on the track. 

The misleading aspect of this example is that one must remember 
that, when the lamplighter moves in one direction, the lamps actually 
move in the opposite direction. In a regular wreath product, this makes 
no difference, but in the general case it does. 


Definition 1.8. Let G = (S) bea finitely generated group acting on 
the right on a set X. The associated Schreier graph has vertex set X, 
and for each x € X,s € S an edge from & to zs. A 


In the case X = G of a group G acting on itself by (right or left) 
multiplication, one obtains the (right or left) Cayley graph of G. The 
group G acts then by graph isometries on its Cayley graph by (left or 
right) multiplication. 


Exercise 1.9. Consider G = Sym(4) generated by {(1, 2), (2, 3), (3, 4)}. 
Draw the Schreier graphs of (in order of difficulty): 


(1) the action of G on {1, 2,3, 4}; 
(2) the action on the collection of 2-element subsets of {1, 2,3, 4}; 
(3) the action of G on itself by conjugation. 


It would be a great mistake to think that a path in the permutational 
wreath product W = H 2x G is a path in the Schreier graph of G 
with decorations in H along the path. We will see where exactly the 
decorations appear, but their positions are rather at the inverses of path 
points. To complicate matters, in case G is abelian the map tH «~! 
is an automorphism of G, from which the above ‘mistaken’ description 
fortunately gives the correct answer. 

Let us fix generating sets G = (S$) and H = (T), and assume for 
simplicity that G acts transitively on X. We also choose a base point 
z eX. Then W is naturally generated by TQ@zru S. 

We consider first the description of elements of W as fg with 
f: X — H and gé G. This is the ‘lamplighter moving’ version, because 
the ‘lamp vector’ f is not shifted, but the position at which it is changed 
varies. By (1.2), the right action on w = fg of 


—a generator s € S gives ws = f (gs); 
—a generator t@z € T@rx gives w(t@z) = (f - t@zrg!)g. 


GROWTH OF GROUPS AND WREATH PRODUCTS 15 


Thus, if w = (t9@z)s1( @z) --- s¢(te¢@x) = fg, then the support of f is 
included in {x, rs; ",..., 2(s1...8¢)"}. 

We may also describe elements of W as gf; this is the ‘earth moving’ 
version, because the ‘lamp vector’ f is shifted, and the lamplighter always 
changes the lamp at a fixed position. By (1.2), the right action on w = gf 
of 


—a generator s € S gives ws = (qs) ote. 
—a generator tQz € T@z gives w(t@z) = g(f - t@z). 


Thus, if w = (t9@z)s1(t1@z) --- se(t¢@x) = gf, then the support of f is 
included in {xs1... 5g, 782... 5¢,..., Se, Z}. 

The semidirect product description of W = H 2x G gives a presenta- 
tion of W by generators and relations; see [17]. Generating as above W 
by T@Y LS, we get 
W= (T@ Yus | relations of G, relations of H, and 

Vt,t'e T,Vy#y' € Y : [t@y, t'@y’'], and 
Wte T, Vy e Y,Vg € Gy : [t@y,g], and 
Vi, te T, Wye Y,Vg € (Gy\G)\ {Gy}: [t@y, (’@y)9)). 


The exact criterion, assuming X AY and H £1, is: 


Theorem 1.10 (Cornulier [17, Theorem 1.1]). The wreath product W 
is finitely presented if and only if both G,H are finitely presented, G 
acts on X with finitely generated stabilizers, and G acts diagonally on 
Xx X with finitely many orbits. 


Diestel—Leader Graphs 


We present a particularly intuitive description of the Cayley graph of 
‘lamplighter groups’ W = FZ, for a finite group F of cardinality q, 
see [83]. 

Let Y denote the (q+ 1)-regular tree, choose a basepoint o, and a 
geodesic ray w: N—> Y starting at o. Imagine 7 as ‘hanging from @’: 
orient the edges on w from w(n) to w(n + 1), and orient every other 
edge from its furthest point to w to its closest. Define then h: FY > Z 
as follows: for each « € 7, there is a unique path in ZY from o to z, 
and set h(a) = (number of edges oriented forward) — (number of edges 
oriented backward) on this path.!° 

Consider now the following graph B(q, q). Its vertex set is {(a,y) € 
Tx TF > h(r)+h(y) = 0}. There is an edge from (x, y) to (2’, y’) precisely 
if {x, 2'} and {y, y’} are connected in 7. Here is a portion of B(2, 2), with 
one tree pointing up and one tree pointing down: 


10 This is usually called a Busemann function. 
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Proposition 1.11. Consider a wreath product G = FZ with a finite 
group F of cardinality q. Denote by s a generator of Z, and consider for 
G the generating set S = (F@1)sUs—!(F@1). Then the Cayley graph of 
(G,S) is B(q, q). 


Proof. Only in this proof, for a subset A of the integers, let us denote 
by F A the set. of finitely supported functions A > F'’. The vertex set of 
ZF may be identified with L],<z FO" x {n}, in such a manner that 
there is an edge from (0, n) to (0 |(~c0,n—1), 2 — 1) for all n € Z and all 
o € F&I, Vertices on the ray w are those of the form (1,7) with n < 0. 

Therefore, the vertex set of B(q,q) may be identified with 
hee Foon) x FO," x {n}. This is easily put in correspondence 
with FZ x Z via the map (0,0’,n)  (t,n) with t: Z > F the finitely 
supported function given by t(k) = o(k) if k < n and t(k) =o'(1— k) 
ifk>n. 

Thus, we put the vertex set of B(q,q) is bijection with G. It is 
now routine to check that the generators in S produce the edges of 
Bq, y)- 


This description of lamplighter groups makes some of their geometric 
features quite transparent. For example, let us consider a finitely 
generated group G = (S$), and its Cayley graph. A vertex v € G is 
called a dead end if all neighbours of v are at least as close to 1 as v. 
More generally, let us say v is on a k-hill if all paths from v to an element 
of norm ||v|| + 1 has to go through a vertex of norm ||v|| — k. In other 
words, from the top of the hill the only way of going to infinity is to first 
go down at least k steps. 

Consider the two dots on the illustration above. Say one of them is the 
origin 1. Then the other one is on a 1-hill. More generally, any element 
in the lamplighter group that is reached from the origin by going down k 
steps, up 2k steps and down again & steps along a reduced path reaches 
the top of a k-hill. 

This is in fact a familiar phenomenon: consider a long street and two 
remote addresses we want to visit on that street, in whichever order, 
from our starting point on the street. The shortest way of doing this is 
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to first go to the closest, and then the other one. The worst possible 
place to start is at equal distance from both addresses. 


2. Growth of Groups 


This section is not a treatise on growth of groups; for that, see rather 

[53]. We do recall some elementary, basic notions, and provide some 

motivation for the material to appear later. Let G be a group generated 

by a finite symmetric set (S = S~'). One defines the word norm on G by 
Ilgll = min{n : g = s1--+ Sn, 8 € S}, 

"1 on G that is invariant under left translation!?: 


d(g,h) = |g hl. 


and a distance 


Thus, G is viewed as a normed space and as a metric space on which 
G acts by isometries via left translation. 


2.1. Formal Growth 


One may be interested in regularity properties of the metric space G; 
these are best studied via the growth series, the formal power series 


Pg(z)= > zl! € Zz]. 


géG 


The natural questions that arise are: What is the domain of con- 
vergence of [g@(z)? What can be said of analytic continuations of 
[q@(z)? What are its singularities? Is the function g¢(z) rational (i.e. in 
Q(z))? or at least algebraic (i.e. there exists a two-variable polynomial 
F(y,z) € Zly, z] with F(T g(z), z) = 0)? 

The consideration of the power series [q@(z), and of the above 
questions, is justified by the answers that have been given: 


e I'g(z) converges in a disk of radius at least 1/#S. If S is symmetric, 
then the convergence radius is in fact at least 1/G/S — 1), with 
equality if and only G is a free product of Zs and Cos with its 
natural generating set, as in Definition 2.9 below. 

e If groups G, H are respectively generated by S, T, then the direct 
product G x H is naturally generated by SU T. One then has 


Yexa(z) =Te@razt), 


see Proposition 2.4. 


11 Tt is only here that we use the fact that S is symmetric, to obtain d(g, h) = d(h, g); 
in fact, it suffices in all that follows to assume that S generates G as a monoid. 
12 Le. d(tg, th) = d(g,h). 
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e If groups G,H are respectively generated by S, 7, then the free 
product G * H is naturally generated by SU T. One then has 


1 1 1 
— + 1, 
VouH(2) Vet) Trt) 


see Proposition 2.5. 


e If the group G is virtually abelian [9], or word-hyperbolic [36], or the 
1ZZ 
discrete Heisenberg group [18] H3 = ( 1 z), then Tq is a rational 


function of z for all choices of the finite generating set S. 

e Wreath products give some examples of power series 'g¢ that are 
algebraic functions, as we shall see in Corollary 3.3 below. 

e If G is a two-step nilpotent group with cyclic derived subgroup, 
then there are generating sets for G such that Tg is a rational 
function of z. However, if G is the five-dimensional Heisenberg group 


1 : ZZ : 
Hs = ( 1 : A} then there are generating sets for which '¢(z) 


is transcendental; see [75]. 


In respect to this last point, note that, for G nilpotent, the growth of 
G is polynomial so [ ¢(z) converges in the unit disk. It is either rational 
or transcendental, by the Fatou theorem [21]. More is known: 


Theorem 2.1 (Pdlya-Carlson [14]). Let A(z) = Yinso anz” be a 
power series with integer coefficients. If A is not rational, then A does 
not extend analytically beyond the unit circle. 


Most importantly, the growth series is a convenient object that encodes 
information on G. A quite satisfactory theory of ‘Euler characteristic’ 
has been developed for groups, see [15]. Here is a special case: if G is the 
fundamental group of a cellular complex 2 with contractible universal 
cover, one declares x(G) to be x(2). More generally, if G has a finite- 
index subgroup H, which is the fundamental group of a space Y, one sets 
x(G) = x(Y%)/|G : H]; this makes sense because if G is the fundamental 
group of 2, then H is the fundamental group of a [G : H]-sheeted 
covering of 2°, whose Euler characteristic is |G : H]x (2). In particular, 
if G is finite, then x¥(G) = 1/#G. This led to the idea that 1/T' g(z) 
could behave like a Euler characteristic, and that its limit 1/T @(1) could 
express x (G). This is not always true, but it does hold in some illustrative 
cases. 

Let us compute, for instance, the growth series of a free group Fy 
generated by a basis.!° It follows from the formula for free products, or 
by direct counting if one notes, for all € > 1, that there are 2k(2k —1)*"! 
elements of norm ¢ in Fy, that 


Ble S= {x1, a, ac igi hes a} and Fy, may be identified with reduced words over S. 
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1+z 
1—(2k-1)z 
The value Pp, (1) is uniquely defined by analytic continuation, and one 
has 1/[r,(_) = 1—k, in agreement with Fy being the fundamental 
group of a graph with 1 vertex and k edges. See [23, 32, 51, 73] for more 
such examples of the ‘1/l¢@(1) = x(G)’ phenomenon. 


Pp, (2) = (1.3) 


2.2. Complete Growth Series 


There exists a stronger property than having a rational growth series: 
a group G = (S) has a rational geodesic combing if there is a finite 
directed graph with edge labels in S and a fixed ‘initial’ vertex, such 
that the set L C $* of words read from the initial vertex along paths 
in the graph has the following property: ZL maps bijectively to G by the 
natural evaluation map of words as elements of G, and the words in LD 
have minimal length among all words in S* having the same evaluation 
in G. Kervaire suggested considering the complete growth series 


Pez) = > gz! e ZG[I[z]]. 
geEG 


Note that wel depends on the choice of generating set S, even though 
we do not mention it explicitly. The series Te(z) iS a power series 
with coefficients in the group ring, and one may again ask whether 
it is rational or algebraic. Because ZG need not be commutative, let 
us define more precisely these notions; we refer to [68] for details. Let 
A CA be rings. An algebraic system over A in variables X1,...,Xn isa 
non-degenerate!4 n-tuple of polynomials P1,..., Py in non-commuting 
indeterminates X1,...,Xn and coefficients in A. In a linear system 
over A, the polynomials are restricted to have degree 1 and contain 
the indeterminate on the right; i.e. the P; are sums of monomials all 
belonging to AUU);<;<, AX;. A solution is an n-tuple (f,,.-.,fn) € A” 
such that all P;(ft,...,fn.) = fj for alli =1,...,n. 

We then say that a power series F(z) € ZG|[z]| is rational, respectively 
algebraic, if it is the first coordinate of the solution of a linear, 
respectively algebraic system over the polynomial ring ZG[z]. A more 
direct definition of the ring of rational functions is that it is the smallest 
subring of ZG|[z]| containing ZG[z] and closed under Kleene’s quasi- 
inversion, the operation F(z)* = (1— F(z))7! =14+ F(z) + F(z)? 4+:-- 
defined for all F(z) € ZG|[z]|| with F(O) = 0. 


Exercise 2.2. If G admits a rational geodesic combing, then its growth 
series is rational. 


14 Let us not detail this too much; suffice it to say that the system must have a unique 
solution once its initial terms f,(0),..., fn(0) have been fixed. 
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Hint: define one variable X; for each vertex 7 of the graph defining the 
combing, and encode the edges of the graph into polynomials. 


Exercise 2.3. If the growth series of G is rational, then G admits a 
quasi-geodesic combing: a language L Cc $*, recognised by a finite graph 
as above, with the property that, for some constant C' € N, all words 
51... 8¢ € L have the property € < C||s1--- sell. 

Hint: consider a polynomial system defining Te, s(z), make sure that 
every term in ZG is accompanied by at least one factor z. Write the terms 
in ZG as linear combinations of words over S$, yielding a polynomial 
system over ZS*. Let C be the maximal length of all these words over 
S that appear in the polynomial system. A solution to the polynomial 
system will be a sum of monomials of the form s,...s¢z", where n = 
I|s1--+ sel] = €/C. 


These notions strengthen the ones for the classical growth series: if 
Te(z) is rational or algebraic, then its image under the augmentation 
map ZG —» Z is rational or algebraic. On the other hand, statements 
concerning the complete growth series are usually not much harder to 
prove than the analogous ones concerning the classical growth series: 


Proposition 2.4. Let the groups G,H and G x H be respectively 
generated by S,T and SUT. One then has 


Poxu(z) =Pe)Pa(z). 


Proof. Every element (g,h) € G x H satisfies ||(g,h)|| = Ilg|] + || All; so 


Texn() = ss ghzlgll+ IA = ys gzl'al ye hzlhll =Fo(z)Py(z). 
(g,h)eGx A geG heH 


Proposition 2.5. Let the groups G,H and G * H be respectively 
generated by S,T and SUT. One then has 


ieaeenes Smeg 
Tow(z) Te) Tr) 


Proof. Every element of w € G«* H may be uniquely written in the 
form w = hog hy --- ge with ho € A, hy,...,he_-1 € A\ {1}, 1,---, 9¢-1 € 
G\ {1}, g € G. Thus, 


Posn(2) = > Ta()(Pel) — DP x) - D)'Tet) 


20 
ax 1 
=Vy_() 


Pea); 
i-fe@-Dee@—-p °” 
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sO 


1 ES ae x a 
fas) = Foam | 6 + Ty (2) - Pelee 
1 1 


a oo 
Ty(z) Te) 
These results are generalized to graph products in [3]: given a graph T 


with vertex set V and a group Gy for each v € V, the graph product of 
the Gy is 


Gp := & Gr [(IGo, Gw| for each edge (v, w)). 


Recall that a clique in a graph is a subset of the vertices, any two of 
which are connected by an edge. They show: 


Proposition 2.6 ({3, Theorem 3.8]). Let each group Gy have generat- 
ing set Sy, and consider the generating set L),ey Suv of Gr. Then 


a » M (x -1); 


clique WCV veW le, 


There are few classes of groups in which Petz) is rational for all choices 
of generating set: 


Proposition 2.7 (Liardet [52]). If G is virtually abelian, then T g(z) 
is rational for all choices of generating set. 


Proposition 2.8 (Grigorchuk—Nagnibeda [33]). If G is word-hyper- 
bolic, then @(z) is rational for all choices of generating set. 


Note that there is no need to consider rings such as ZG; the definition 
is more naturally phrased in terms of a semiring such as NG. The 
polynomials P; are restricted to be sums of products of monomials, and 
no subtraction is allowed. The notions of Z-rationality and N-rationality 
differ subtly, see e.g. [10]. 

Let us now compute explicitly the complete growth series of a free 
group. It simplifies a little the notation to answer a slightly more general 
question. We denote throughout the text the cyclic group of order p 
by Cp: 


Definition 2.9. A free-like group is a finite-free product of Zs and Cos. 

Say that a free-like group G has m, factors isomorphic to Cy and m2 
factors isomorphic to Z; then it has a symmetric generating set of size 
my + 2mg, consisting of one generator for each Cp and a generator and 
its inverse for each Z. The group G is characterised by the property 
that its Cayley graph (see after Definition 1.8) is an (m1 + 2m2)-regular 
undirected tree. We call such an S$ a natural generating set for G. A 
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For instance, the free group Fm/2 is free-like for m even, and *” C3 is 
free-like. 

Let G be free-like, and let S denote a natural generating set of G with 
cardinality m. We shall see that G has rational complete growth series. 
For ease of notation, we write 3 for s~!. We identify elements of G with 
reduced words over S, i.e. words not containing consecutive ss. 

For all s € S, define F's € ZG|[z]| by 


a Se 


weG, not 
starting with $ 


so Pg(z) = 1+ seg 82Fs. We have the linear system in non- 
commutative unknowns F 


Fe=1+ >) teFi, 
teS,t#s 


with solution F, = (1 — 32) (1— Yyeg tz + (m — 1)z”) *, 80 finally 


[= 2" 
1— YD yeg 82 + (m= 12?’ 


Te(z) = (1.4) 


compare with (1.3). 


2.3. Asymptotic Growth 


We return to a group G with generating set S, and view it as a metric 
space for the word metric. We consider the volume growth of balls in the 
metric space (G, d); this is the growth function vg: R+ > N given by 


vg(R) = lg € G: IIgll < R}. 


It is naturally related to the formal power series '(z): indeed vg(R) is 
the sum of all coefficients of T(z) of degree < R; equivalently, for R ¢ N 
it is the degree-R coefficient of [(z)/(1 — z). Thus, by Tauberian and 
Abelian theorems (see e.g. [59]), asymptotics of ug(R) as R > co may 
be related to asymptotics of [@(z) as z > the convergence radius. In 
particular, the function vg(R) grows as R4 if and only if [ g(z) converges 
in the unit disk and has an order-d pole singularity at 1. 

The norm || - || depends on the choice of generating set S, but only 
mildly: different choices of generating sets give equivalent norms, and 
equivalent metrics. If for v,w: Ry — N we write v X w to mean that 
uv(R) < w(CR) for a constant C € R+ and all R > 0, and we write 
v~w to mean v X w X v, then 


Lemma 2.10. The ~-equivalence class of vg is independent of the 
choice of generating set. 
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Proof. Let S,S’ be two finite generating sets for G, and let us 
temporarily write ||g|| 5, llglls’ and vg,s, vq,s’ for the norms and growth 
functions with respect to S,S’. There exists then a constant C' € N such 
that ||s’\|g < C for all s’ € S’, and thus |lgllgy < Cllgllg. This gives 
ug,s(R) < va,s'(CR). The reverse inequality holds by symmetry. 


Note, as a consequence, that all exponentially growing functions are 
equivalent, and that R@? and C- R4 are equivalent as soon as d > 0. The 
exponential growth rate 


4,5 = limvg,g(R)/® (1.5) 


is nevertheless worthy of consideration, and will be discussed in 88. 


2.4. History 


Interest in asymptotic growth of groups dates back at least to the early 
1950s, in the works of Krause [50], Efremovich [19] and Svarc [76]; they 
were seeking coarse invariants of manifolds based on their fundamental 
group. Milnor noted in [57] that, if G is the fundamental group of a 
compact riemannian manifold .@, then vg is equivalent to the volume 
growth of balls in the universal cover of 74. 

Here is a schematic of the known equivalence classes of growth 
functions of groups. Note the two dots for the two groups of order 4, 
respectively the two groups Z‘ and the Heisenberg group H3 with quartic 
growth: 


aa = 7? Whe 
1 © OZ: BA Rey 
ca ae gap 227 : <P> 
12345 ae Umin exp(R°76-- exp(R) 


The left of the graph is occupied by finite groups; the growth of a finite 
group is equivalent to the constant function taking value the order of the 
group. Abelian groups, and more generally virtually!> nilpotent groups 
have polynomial growth of type R¢ for an integer d. The converse is a 
deep result by Gromov: 


Theorem 2.11 (Gromov [34]). A finitely generated group has 
growth function bounded by a polynomial if and only if it is virtually 
nilpotent. 


It follows also from Gromov’s argument that there is a superpoly- 
nomial function vmin(R) such that all groups with growth % umin are 


15 a property is said to virtually hold if it holds for a finite-index subgroup. 
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virtually nilpotent; so there are no functions with growth strictly between 
polynomial and vpin. Explicit estimates in [72] imply that one may take 
Umin(R) = Rs ew. although the gap is probably larger. Note also 
that there need not exist a -largest function Umin. If one restricts oneself 
to virtually residually!® nilpotent groups, then the gap extends at least to 
exp(R!/2), see [31]; if one restricts oneself to virtually residually solvable 
groups, then the gap extends at least to exp(R!/®), see [82]. 

Milnor asked in 1968, in a famous problem in the ‘American Math 
Monthly’ [58], whether there are groups whose growth function is neither 
polynomial nor exponential. He also conjectured in that note that 
groups of polynomial growth are precisely the virtually nilpotent groups. 
Milnor and Wolf showed in [56, 84] that virtually solvable groups have 
either polynomial or exponential growth, and the inexistence of groups 
with growth between polynomial and exponential became known as 
‘Wolf’s conjecture’. Recall the celebrated ‘Tits’ alternative’ [79]: a finitely 
generated subgroup of a linear group in characteristic 0 either is virtually 
solvable or contains a non-abelian free subgroup; from this it follows 
that linear groups always have polynomial or exponential growth (see 
furthermore §8). 

However, groups of intermediate growth exist, and Grigorchuk [29] 
gave such an example, known as the first Grigorchuk group Go12; see §4. 

The growth of Go12 is not known, even up to ~-equivalence; conjec- 
turally, it is the same as the growth of the group Wo12(C2), which will 
be introduced in 85. The hatched region above indicates that, in fact, 
there are many groups of intermediate growth, and that any ‘reasonable’ 
function between exp(R°6--) and exp(R) is equivalent to the growth 
function of a group; see Theorem 6.2. 

There are at least two arguments for considering asymptotic growth 
rather than exact growth of groups. Firstly, the asymptotics of the 
growth function does not depend on the generating set, by Lemma 2.10, 
so is an invariant of the group itself. Secondly, we expect ‘most’ growth 
series to be transcendental power series, so that they are probably 
difficult to describe, manipulate or expand; this happens, for example, 
for groups of subexponential growth, whose growth series converges in 
the unit disk so is either rational or transcendental, by Fatou’s theorem 
(see Theorem 2.1). 


3. Growth of Regular Wreath Products 
We consider in this section a wreath product W = H?2x G, and compute 


its growth series. We assume that generating sets $,7 for G,H 


16 A property is said to hold residually if for every non-trivial element there exists a 
quotient in which this element remains non-trivial and the property holds. 
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respectively have been chosen, and that the growth series of G and H 
are known. 


3.1. Wreath Products Over Finite Sets 


As a first step, let us suppose that the set X is finite, say X = {2},..., 
ta}. Then, as generating set for W, we may take {tQ@z:teT,x € X} 
ul S. For this generating set, we have 


wz) =la(z)**TeQ), 


and by a small abuse of notation the same relation on complete growth 
series: 
~~ d ~ — 
Pw) =|] [Pa@en | Te. 
i=1 

Indeed, every element w € W may uniquely be written in the form 
w = (hi Q@za1)--- (hg@zq)g for some hj,...,hg € H,g € G, and the 
growth series of {h;@az;: hj € H} naturally coincides with that of H. In 
particular, if [g(z) and I'y(z) are rational, then so is Pw(z). 

Johnson obtained in [42] the same conclusion for more complicated 
generating sets of W. 


3.2. Lamplighter Groups 


The next case we consider is G = X = Z, and in particular ‘lamplighter 
groups’. Because the computations will be generalised in the next section, 
we content ourselves with a brief description of the growth series, and for 
simplicity assume that His a finite group. We consider W = H 2 Z, denote 
a generator of Z by s, and let W be generated by the set {s, s~!}UH@1. 

Consider an element w € W. If its image under the natural map 
W — Z is nonnegative, then it may be written minimally in the 
form s~™(ho@1)s(hy@1)--- s(hp@1)s—” with hy € H, m,n > O and 
p = m+n, while if its image in Z is negative, then it may be 
written minimally in the form s™(ho@1)s~!(hy@1)--- s~}(hp@1)s” with 
p> m+n. Furthermore, ho must be non-trivial unless m = 0, and hp 
must be non-trivial unless n = 0. 

All of these constraints are local and therefore rational, except the 
long-range relation between m,n,p. However, in terms of comput- 
ing growth series, the letters in the expression s~™(hg@1)s(h,@1)--- 
8(hp@1)s~” can be permuted at no cost; and the set of expressions of 
the form 


(hp@1)s~!s(hy@1)--- s~1s(hm@1)s(Am41@1) 
+++ 8(hp—n@1) 887} (hp_n41@D) - + s8~1 (hp @1) 


is indeed a rational language, so that its growth function is rational. 
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Exercise 3.1. Compute the growth series of C2? Z with the standard 
generators C)@1 U {s,s~!}. Note that the growth function grows 
exponentially, at the same rate as Fibonacci numbers. Could you have 
guessed the appearance of Fibonacci numbers without going through the 
calculations? 


3.3. Regular Wreath Products with Free Groups 


We compute in this subsection the complete growth series of a wreath 
product of the form W = H 2G for G a free group. In fact, we suppose 
more generally that G is free-like, see Definition 2.9, so that its Cayley 
graph for the generating set S is an m-regular tree 7. We keep the 
convention of writing 3 for s~! € S$. We suppose as usual that W is 
generated by T@1uU S. 

Consider w € W, written as w = (hg@1)91(h1@1)--- ge(he@1) with 
gi € Gand h; € H. Following the arguments in 81.3, one may write it as 


e 
w=[[i@e)-g--- ge, with eg = (gi--- gi). 
i=0 

The support of w is the subgraph of 7 traced by inverses of prefixes 
of the word g ... gg; it is the convex hull of {e9,..., eg} in 7. We shall 
count elements of W by examining their possible supports and summing 
over them. 

For each s € S, let ©; denote the set of finite subtrees of 7, containing 
1 and no element of S' except possibly s. Each 0 € Os has outer vertices, 
with at most one neighbour in 6, and inner vertices, with at least two 
neighbours in 9. We introduce non-commutative power series F,(, y, z) 
with coefficients in ZG[z], which count the number of Eulerian cycles!” 
in trees Os, weighted by length in z; the variables x, y belong to G and 
in particular are not assumed central. The series F(z, y, z) are defined 
by the algebraic system 


Bs(a,y,2) =1+4 sysz2+sx{ |] Ei(a,y,)-1] 327. (1.6) 
teS ,tAs 


The monomials in £, are in bijection with (Eulerian cycles tracing) 
trees in ©,; if a tree 0 with p edges has inner vertices at fi,..., fn and 


17 Le, cycles that traverse each edge once. 
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non-trivial outer vertices at fj,...,f’,, then the monomial corresponding 
to it is the product, in some order, of 22? gh adn yt. ytn’, 
Indeed the equation defining FE says that a monomial counted by Es 
is either the empty tree (counted as 1) or a single edge from 1 to s 
(counted as sy5z*), or an edge from 1 to s, followed by m— 1 subtrees 
counted recursively by FE; for all t € 5, which are not all empty. Note 
that if 6 has p edges, then a minimal closed path that explores all vertices 
of 6 has length 2p. 

Let D(z) denote the sum of wz!!! over all elements w € W which 
belong to the base group H © and whose support is an element of Os. 
Such elements may be counted as follows: starting from a support 6 € Og, 
choose a word gi... g¢ in G of minimal length that visits all vertices of 
8; and, each time a vertex is first visited, insert an element of H, which 
furthermore must be non-trivial if the vertex is outer. Therefore, 


Ds(z) = EsU (2), Px(2) — 1,2). (1.7) 


Next, let F,(z) denote the sum of wz!”'! over all elements w = fg ¢ W 
with f: G— H, g € G not beginning in 5, and whose support does not 
contain s. We have a linear system 


F(z) =[[ Di@+ >>| [] Du ] eh, (1.8) 
t#3 t4#s \uAts 


because in every such element either g = 1 and the support explores all 
the neighbours ¢ of 1 except 5, or g begins by a generator, say t, and 
then its support explores all neighbours of 1 except s,t, then moves to 
t, and continues by an element not starting by ¢. Finally, 


Pw) =[[ D+ >) | [] De® | szFe(), (1.9) 
ses seS \t#s 


for the same reasoning as above. Combining Equations (1.6-1.9), we 
deduce: 


Theorem 3.2. If H is a finitely generated group whose complete growth 
series [y(z) is algebraic, and G is a free-like group, then the complete 
growth series of W is also algebraic. 


Corollary 3.3 (Parry, [65]). If H is a finitely generated group whose 
growth series Ty(z) is algebraic, and G is a free-like group, then the 
growth series of W is also algebraic. 

If furthermore T(z) is rational and m < 2, then Tw(z) ts also 
rational. 

On the other hand, if m > 3 then Tw (z) does not belong to the field 
generated by z and Ty (z). 
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Proof. Apply the augmentation map w: g +> 1 to Equations (1.61.9); 
this gives an algebraic system of degree max(m — 1, 1) expressing I w (z) 
in terms of z and I'y(z). In particular, for m = 2 it is a linear system. 
Conversely, assume m > 3 and let p denote the convergence radius of 
the series of the image of D, under w. Note that lim,_,,- @(Ds)(z) 
is finite: if the limit were infinite, convergence to infinity would be 
order (m — 1)x itself, a contradiction. Therefore, @(D;) has a non-pole 
singularity, so is not in Q(z, y(z)). 


Exercise 3.4. Show that, for the lamplighter group C22Z, the complete 
growth series is not rational. 
Hint: use Exercise 2.3. 


3.4. Travelling Salesmen 


To glimpse at the limit of what can be computed, consider now the case 
G = Z?. No property of P w(z) is known, and this is due to the fact that 
there is no good description of words of minimal norm describing group 
elements. 

In fact, the problem can be quite precisely stated as follows. One 
is given a point poo and a set {pj,...,pe} in Z?, and is required to 
find a walk of minimal length on the grid that starts at (0,0), visits 
all the points pj,...,pe in some order, and ends at poo. This is a 
classical travelling salesman problem and is known to be NP-complete, 
see [24, 25]. It is a small step to venture that finding a good description 
of minimal paths is at least as hard as finding those paths’ length. 


3.5. Asymptotic Growth 


Regular wreath products, in non-degenerate cases, all have exponential 
growth. This is in stark contrast to the case of permutational wreath 
products, as we will see in §5. 


Proposition 3.5. Jf H £1 and G is infinite, then W = H2G has 
exponential growth. 


Proof. Choose h 4 1 € H, and without loss of generality assume 
that h is a generator of H. Since G is infinite, there exists an infinite 
word gig2... that traces a geodesic in the Cayley graph (see after 1.8) 
of G, with gj, 92,... generators of G and also of W. In particular, all 
(g,--- gi)? are distinct. Consider then, for any @ € N, the set of elements 


{(h@1)© gi (AQ1)* -- - ge(h@L™ : €0,...,€¢ € {0, H}}. 


All of these elements have norm at most 2@ +1, and there are 2+! such 
elements. They are all distinct, since when they are rewritten in the form 
for... ge one has f((g1---gi)~1) = h® so that the €; can be recovered 
from the element. Therefore, vy (2@ + 1) > 21. 
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4. (Self-)similar Groups 


We begin by introducing self-similar groups. They are groups with an 
additional structure: 


Definition 4.1. <A group G is self-similar if it is endowed with a 
homomorphism ¢: G > G?x Sym(X) for some set X. The map ¢ is 
called the wreath recursion of G. A 


In this text, we shall always assume that the set X is finite, and 
shall (unless stated otherwise) also assume that the homomorphism @ 
is injective. A self-similar group is a group G in which elements may 
be recursively described by a permutation of X, decorated by elements 
of G itself. In case X = {0,1,...,d — 1}, we also write elements of 
Grex Sym(X) in the form ((g0,..., g@—1))a for group elements go,..., Gd—1 
and a permutation zm € Sym(X). 

It is essential to understand that being self-similar is an attribute of a 
group, and not a property. Thus, for example, a topological group is a 
group endowed with a topology; and every group is a topological group, 
for the discrete and the coarse topology. In the same vein, every group 
is self-similar, merely for the reason that it is similar to itself. Taking 
X = {0} and $(g) = ((g)) is uninteresting but not illegal. 


4.1. Finite-State Self-Similar Groups 


We describe two fundamental constructions of self-similar groups. 

For the first, start by a well-understood group F, such as a free group; 
and choose a (not necessarily injective!) homomorphism ¢: F > F vx 
Sym(X). There exists then a maximal quotient of F on which the map d 
induces an injective wreath recursion. To wit, one defines an increasing 
sequence N; of normal subgroups of F’ by 


No=1, Msi =o (NS), (1.10) 


and sets G = F/\(); Ni. By construction, the map @ induces an injective 
map ¢: G—> Gx Sym(X). 

An important example of a group defined by this method—and which, 
essentially, cannot be defined differently—is the first Grigorchuk group, 
introduced in [28] and based on [2]. Consider 


F = (a,b,c,d | a”, b?, c?, d?, bed) = Cy * (Co x CQ), (1.11) 
and define ¢: F > F 2 Sym(2) by 


ga= XX, bb) =a le, GO) = la la, b=] Ip. 


It is straightforward to see that ¢ is a homomorphism—just compute 
the images of the relators. It is, however, remarkable that one may 
compute efficiently in G just using this description. We use the same 
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letters a, 6, c, d for the corresponding generators of G. As an illustration, 
let us check that the relation (ad)* holds in G. Writing the permutation 
diagrams horizontally, one has 


b b b b b? 
een =O) OS >< YY 


so (ad)4 = 1 in G because ¢ is injective. 


Exercise 4.2. Using similar calculations, compute the exponent of ab 
and ac in G. 


Note that in that example, G = (S) for the set S = {1,a, b,c, d}, 
which has the property that (S$) is contained in S x S x Sym(2). More 
generally, 


Definition 4.3. Let G be a self-similar group. A subset S$ C G is 
state-closed if @(S) is contained in Sx x Sym(X). 

An element g € G is finite-state if there exists a state-closed subset 
of G containing g. A subset of G is finite-state if all its elements are 
finite-state. A 


Exercise 4.4. Let Z = (t) be endowed with the self-similar structure 
o(t) = Se . Show that only ¢° is finite-state. 


Lemma 4.5. The product and inverse of finite-state elements is again 
finite-state. 


Proof. If g,h are finite-state contained respectively in state-closed sets 
S,T, then gh! is finite-state, as it belongs to the finite state-closed set 
oe mee 


Therefore, a finitely generated self-similar group G is finite-state if 
and only if its generators are finite-state, and one may assume that G is 
generated by a state-closed set. 

In that case, the wreath recursion of G may conveniently be repre- 
sented by an automaton, more precisely a Mealy automaton. This is a 
directed graph with vertex set S called its states, and with an edge from 
s€S tot € S, with label ‘z|y’, whenever the decorated permutation 
o(s) maps z € X to y € X and has label t on the edge > y. Thus, in 
a sense, the graph is the dual of the permutation diagram, with the roles 
of X and S exchanged. The automaton generating the first Grigorchuk 
is, with the convention X = {0,1}. 
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va, eee 


Rew) 
ES, 


Assume that the self-similar group was obtained as above as a quotient 
of a self-similar group F with a map ¢: F > Fy Sym(X). Recall 
that the word problem asks, given a word in the generators of a finitely 
generated group, to determine whether the group element that it defines 
is trivial. There are groups, even finitely presented, in which the word 
problem is unsolvable [62]; however, 


Qo. 1|1 


Lemma 4.6. Let F be a finite-state finitely generated self-similar group 
with wreath recursion @ and solvable word problem, and G be the maximal 
quotient of F on which the induced wreath recursion G > Gx Sym(X) 
is injective. Then G also has a solvable word problem. 


Proof. Assume without loss of generality that F is generated by 
the finite state-closed set S, and denote also by S the corresponding 
generating set of G. Given a word w € S* of length @, it defines a state- 
closed element of F, belonging to the state-closed set S¢. Consider the 
corresponding automaton with vertex set $°. 

Let U ¢ S* denote the set of states that are reachable from w € Sé 
by arbitrarily long paths. This set is computable: set Uj := {w}, and for 
i> 0 set Uj,, := U;U {endpoints of edges starting in Uj}; then the U; 
form an increasing sequence of subsets of S$“, hence stabilize, say to ul. : 
Note that Uj is the set of states reachable from w. For all i > 0, let 
Ui denote those endpoints of edges starting in U/’; then the U;’ form 
a decreasing sequence of subsets of S$“, hence stabilize, to U. 

The element of G defined by w is trivial in G if and only if both of 
the edges starting in Uj have labels of the form ‘z|z’ for some x € X, 
and all elements of U define trivial elements of F’ under the evaluation 
map S* > F. 

More precisely, let m € N be minimal such that every element of 
U may be reached from w by a path of length at most m. Then the 
conditions above imply that w belongs to the normal subgroup Nm, of 
F, see (1.10). 
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The above proof amounts to constructing a Mealy automaton for the 
action of $*, and examining it to determine which of its states are trivial 
in G. 


4.2. Linear Groups 


Here is another construction of self-similar groups. Consider a group G, 
a subgroup H, and a homomorphism ¢9: H — G. By the ‘permutational 
Kaloujnine—Krasner theorem’ 1.6, there is a natural extension ¢: G > 
Gx Sym(X), with X = H\G, in such a manner that @(h) = 
(...,@0(h),...))... for all h € H, with the ‘go(h)’ in position H € H\G. 

Alternatively, this map @ may be directly constructed as follows: 
choose a transversal T of H in G, namely a subset T C G such that 
every g € G may uniquely be written in the form ht with he H,te T. 
Identify X with T. Let then ¢(g) be the decorated permutation that 
sends t € T to u € T with label ¢o(tgu-!) whenever tgu~! belongs 
to A. 

Here is a fundamental example: choose a prime number p, and consider 


G=To(p) = (2 2) N SLe(Z). 


Consider also the matrix ® = G ot) € SL2(Q), and H = GN GG?” Set 


¢o(h) = h®. This example generalises naturally to G any matrix group, 
such as for instance a congruence subgroup of SLy,(Z) for arbitrary n, 
or even SL,,(Z) itself. This shows that the class of linear groups over Z 
is contained in the class of self-similar groups. 

In fact, the only essential ingredient of the above construction is the 
element ® in the commensurator of G. Recall that, for a subgroup G 
of a group L, the commensurator of G is the subgroup of those x € D 
such that GN G* has finite index in G and in G”. If G is an irreducible 
lattice in a Lie group G, then G may be called arithmetic [54] if its 
commensurator is dense in LD; e.g. the commensurator of SL,(Z) in 
SL, (R) is SL, (Q). Then all arithmetic lattices admit self-similar actions 
on rooted trees [46]. 


4.3. Rooted Trees 


Let X be a set, and consider the associated rooted regular tree 7: its 
vertex set is X* = {xj...21 : a] € X} = L]jx X', and it has an edge 
between 2j412;...2, and a... 2, for all 2; € X. The tree is rooted at 
the empty word, the unique element of X°; the set of vertices at distance 
i from the root is identified with X*, and the Cartesian product X° is 
naturally interpreted as the boundary 07 of the tree, namely the set of 
infinite rays emanating from the root. Here for illustration is the top of 
the binary tree: 


GROWTH OF GROUPS AND WREATH PRODUCTS 33 
ye io a a 


00 10 01 11 


000 =100 010 110 O01 101 O11 111 


Let W denote the isometry group of 7, namely the set of bijections 
of X* that fix the root © and preserve the edge structure of 7. Given 
g € W, let o € Sym(X) denote the action of g on X = X!) and for all 
x € X define an element g; € W by (ay --- a, X)g = (Sy °-- 21) Gz (2°); 
namely, gz describes the action of g on the subtree X*z as it is carried 
to X*x° by g. 


Lemma 4.7. The map 


W > Wisym(X) 


vi g > (gr :2€E X)o 


is @ group isomorphism. 


Proof. Given gr € W and o e€ Sym(X), an element g ¢ W may 
uniquely be defined by (ap---212)g = (2p -++21)gz («°). This proves 
that @ is bijective. 

To see that ¢ is a homomorphism, consider the #X subtrees below 
the root. They are permuted according to the permutation part o of ¢, 
and simultaneously acted upon by the decorations g;. Composition of 
decorated permutations therefore coincides with composition of tree 
isometries. 


Let us now start with a self-similar group G. Its wreath recursion 
o: G=> Gry Sym(X) then defines an action of G on X. Furthermore, 
the wreath recursion can be ‘iterated’: one has maps 


ats Gypsy 25 (Geese Sem 
= G2y2 Sym(X) 2x Sym(X)) — --- 


so that G acts on X* for all i € N. Furthermore, these actions are com- 
patible with each other, in the sense that the map (241, 2j,...,21) bt 
(x;,.-.,21) interlaces the actions on X‘t! and X?. Taking the inverse 
limit of X* under these projection maps gives an action on the Cartesian 
product X%. Note that sequences in X° are infinite on the left, namely 
of the form (..., %j41, Uj,..-, 21). 

The compatibility between the actions on X* and X*+! precisely 
means that G acts by tree isometries on the rooted regular tree 7 with 
vertex set X*. 
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Note that, even if the wreath recursion @ is injective, the action of 
the G on the tree Y need not be faithful. This is, however, the case 
for the examples of self-similar action of the first Grigorchuk group (see 
Proposition 5.9) and of the congruence subgroup Ip(p). 

We introduced the self-similar structure on T'g(p) and not on SL2(Z) 
because the latter does not act on the rooted p-regular tree, in which 
each vertex has degree p+ 1 except the root, which has degree p. If 
we add an edge upwards from the root, and a rooted p-regular tree 
above it, to the rooted p-regular tree, we obtain a (p+1)-regular tree on 
which the action of T9(p) extends to an action of SL2(Z). In fact, this 
action is already well known, see [71, §II.1]: the (p+ 1)-regular tree is the 
Bruhat- Tits tree of SL2(Zy). Its vertices are homothety classes of lattices 
= 22 in Q,, and there is an edge between classes Q* A an Q A’ if they 
admit representatives aA,a’A’ with aA C a’A’ and [a’A’ : aA] = p. 
The group SL2(Q,) naturally acts on lattices, and SL2(Z,y) acts as the 
stabilizer of the root Q*Z?.. The congruence subgroup Ip(p) fixes an 
edge adjacent to the root, and the rooted p-regular tree Y is spanned 
by those lattices of the form ((p”, 0), (29 + 21p +--+: + tn_1p”+, 1)) for 
né€N and 1,...,%n_1 € {0,...,p — 1}. 

Let us remark in passing that obtaining an action on a rooted tree 
is not spectacular in itself: every countable residually-p group acts on a 
rooted p-regular tree. Indeed, choose a descending sequence G = Go > 
G1 > +++ of subgroups with [G : Gi] = p* and (1) G; = 1. Let the vertices 
of 7 be the set of right cosets of all G;, with an edge between Gg and 
Gi+1g for alli € N,g € G; and let G act by right multiplication on 7. 

This action is in general not self-similar, nor is it ‘economical’, in 
the sense that the permutation group acting on X * may have order 
comparable to (#.X)! rather than (#4.X!)#*". 

Finally, let us return to the construction of a self-similar group G as a 
quotient of a self-similar group F' so that the wreath recursion becomes 
injective. Knowing that the action of G is faithful helps in solving the 
word problem, in case G is not finite-state: 


Lemma 4.8. Let F be a self-similar group with wreath recursion @ and 
solvable word problem, and let G be the maximal quotient of F on which 
the induced wreath recursion is injective. Assume that the action of G 
on the tree J is faithful. Then G also has solvable word problem. 


Proof. Let S be a generating set for Ff’, and consider w € S*. We start 
two semi-algorithms in parallel; the first one will stop if w is non-trivial 
in G, and the second one will stop if w is trivial in G. 

If w is non-trivial in G, then it will act non-trivially on some vertex 
of 7, and this vertex may be found by enumerating all vertices of 7 
and computing the action of w on it by applying ¢. 

If w is trivial, then it belongs to one of the normal subgroups Nj. 
Going through all i = 0,1,... in sequence, and iterating i times @ on 
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w yields #X* elements of F. If all of them are trivial in F, then w is 
trivial; otherwise, continue with the next 7. 


Here are two fundamental examples of self-similar groups. Let 7 = X* 
be a rooted regular tree, and let Q be a group acting transitively on X. 
Consider the iterated wreath products Q; = Q?x Qvx-::?x Q, with 1 
factors; these groups act naturally on X* by the imprimitive action. 

On the one hand, there is a natural map Q;+1 — Q;, given by deleting 
the leftmost factor, i.e. naturally mapping Qi41 = Qirx Qj to Qi = 
lex Q;. Set then G = lim Q;, the projective limit being taken along 
these epimorphisms. The self-similarity structure ¢: G > Gx Q is 
induced by the identity map Q; —> Qj-12x Q. Because it ‘peels off’ 
the rightmost factor, it is compatible with the inverse limit. It defines a 
profinite self-similar group G. 

On the other hand, there is a natural map Qj; © Qj+1, given by 
inserting a trivial leftmost factor, i.e. naturally mapping Q; = 1l2y Q; 
to Qi41 = Qrx Q;. Set then L = lim Q;, the union (= injective limit) 
being taken along these monomorphisms. The self-similarity structure 
¢: L > Lx Q is induced by the identity map Q; —> Q;-12x Q. 
Because it ‘peels off’ the rightmost factor, it is compatible with the 
union. It defines a locally finite group L. 


Exercise 4.9. Show that L is a dense subgroup of G. 


A law for a group G is a word w(21, 22,...) in variables x1, 272,... such 
that, whenever the elements 21, %2,... are replaced by group elements 
from G, the word evaluates to 1 in G. For example, abelian groups are 
characterised as those groups satisfying the law w = [21, 2]. 


Exercise 4.10. Show that L satisfies no non-trivial law. 

Hint: it suffices to look at the case X = {1,2}. By Theorem 1.4, every 
finite 2-group imbeds in Q; for some 7 € N, and therefore in G. Finally, 
the free group is residually 2. 

See [1] for a general result about inexistence of group laws, which 
covers the group L. 


4.4. Similar Families of Groups 


The notion of a self-similar group may be generalised to a family of 
similar groups. 


Definition 4.11. Let Q bea set, and let 0: Q© bea map. A similar 
family of groups over Q is a family (Go)woea of groups and a family of 
homomorphisms 


ou: Go > Gow 2x, Sym(X), 


for a family of sets (Xw)wea: A 
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Just as cone each group G, acts on a tree % with vertex set 
Lixo X gi-lw X+++X Xp. This rooted tree is now not anymore regular, but 
it is still. iene; homogeneous, in that its isometry group is transitive 
on the set of vertices at a given distance from the root. 

As before, there are two fundamental examples of similar families of 
groups. Let (X)weg be a family of sets, and let (Q»)weq be a family of 
groups, with Q, acting on X,, say transitively for simplicity. 

Consider the iterated wreath products Qu,i = Qgi-1u%X, 42,17 °Xy Qo» 
with 7 factors; these groups act naturally on X,i-1, x --- x X» by the 
imprimitive action. 

On the one hand, there is a natural map Qo,i+1 — Qo,i, given by 
deleting the leftmost factor, i.e. naturally mapping Qw,i+1 = Qe iatX i 
Qui t0 Qo,i = 1 Ua. Qui. Set then Gy = lim Qu,i, the projective 
limit being taken along these epimorphisms. The self-similarity structure 


Oe ne Os 2x, Qw» is induced by the identity map Q.ji => 
Qow,i-12X,, Qw- Because it ‘peels off’ the rightmost factor, it is compatible 
with the inverse limit. It defines a profinite self-similar group Gy. 

On the other hand, there is a natural map Q.,; © Qw,i+1, given 
by inserting a trivial leftmost factor, i.e. naturally mapping Q,,; = 
Ley 51, Qo,i tO Qo,it1 = Qin X,i-1, Qo,i- Set then Ly = lim Qi; 
the injective limit being taken along these monomorphisms. The self- 
similarity structure dw: Ly > Lew?X,, Qu is induced by the identity map 


Qo yi = Qow,i-1?X, Qu. Because it ‘peels off’ the rightmost factor, it is 
compatible with the union. It defines a locally finite self-similar group L,. 


4.5. The Grigorchuk Family G, 


We shall concentrate particularly on one specific example. Write {0, 1, 2} 
for the three non-trivial homomorphisms C2 x Cp > C2, identified for 
definiteness as follows. We view the source C2 x Cp = {1,6,c,d} asa 
subgroup of the group F' given in (1.11), and the range Cy = {1, a} in 
that same group F’. The three homomorphisms are then uniquely defined 
by ker(O) = (6) and ker(1) = (c) and ker(2) = (d). Set then 


Q = {0,1,2}°, 0(W0W102...) = @1a2.... 


We start by the similar family (F),-q with maps ¢,: F > F 2? Sym(2) 
given by 


Po(a) = oe and for all z € {b,c,d}: (x) = feo) |e ; 


we define normal subgroups (Nj,i)ieNwen Of F by Noo = land Ny i41 = 
Oe (Ne ds and set Gy = F/Ujen Noi. This is the same construction 
as above, and computes G, as the maximal quotient of F’ such that 
the maps @o: F > F. Sym(2) descend to injective maps ¢y: Gw > 
Gow i Sym(2). 
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In particular, letting G denote the Grigorchuk group defined in §4.1, 
we have isomorphisms 


——— Go012)% =— G(120)% — G201)~ 
identifying the generating sets as follows: 


{a,b,c,d} {a,d,c,b} {a,b,d,c} {a,c,b, d}. 

The groups G, all act on the binary rooted tree, and it is easy to see 
that the orbit of the ray 1° is dense. Therefore, the groups G, could just 
as well have been defined by their actions on their respective orbit 1° Gy. 
These are naturally graphs, called Schreier graphs, with vertex set 1° Go, 
and with an edge from 1%g to 1% gs for each generator s € {a, b,c, d}, 
see Definition 1.8. 


Proposition 4.12. The graph 1° G, is isometric to the half-infinite 
line N with multiple edges and loops. Under this identification with N, 
the action of Gw is given by 


a(2j) = 27 +1, a(2j + 1) = 2), 
and for allz €{b,c,d}, x(0) =0, w(2"(2j + 1)) = 2(2a;(j) + DV. 


Proof. Consider the infinite dihedral group D = (a, z | a”, x”), and the 
wreath recursion 6: D> D2Sym(2) defined by 


o@= SX,  d@)= Ja |e. 


It defines a faithful action of D on the binary rooted tree Y with vertex 
set {0,1}*, and the action on the ray 1° is isomorphic to the action 
on the set of cosets (x)\D, since the stabilizer in D of 1° is (x). We 
abbreviate 1° =: I. The Schreier graph of the latter is a half-infinite 
line 
a x a x a x a x a x 

tice o— o— e o— o— o-— e o— eo. 

& 0 GG % %, % %,, z, @, “%,, %, 


Since the action of generators of D change only a single symbol on 
sequences in {0,1}, the identification of the Schreier graph’s vertices 
with N is explicit: it is the ‘Gray code’ [27] enumeration starting from 
the left-infinite word 1. Thus, the sequence ...11a;...21 is identified 
with the integer )°}_(1 — z;)2I-!, reading the number in base 2 with 
Os and 1s switched. 

Now, to obtain the Schreier graph of G,, one replaces each ‘z’ edge by 
a pair of edges labelled by two letters out of {b, c,d}, and puts loops at 
the extremities of the edge labelled by the remaining letter. The choice 
of which letter becomes a loop is determined by the position of the edge 
on the graph and the sequence w. 
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For example, here is the Schreier graph of the action of the first 
Grigorchuk group Go12 = (a, b,c, d) on 1° G: 


b dd c c did b b d dee CG ay de odin d 
b d b c b d b b 
cfob seb feb feb yeb yeh ef of yy 
t c b c d c b c c 
d 


(1.12) 


5. Growth Estimates for Self-similar Groups 


One of the purposes of this section is to reprove the following result. Let 
Q’ denote the subset of Q consisting of sequences containing infinitely 
many of each of the symbols 0, 1, 2. 


Theorem 5.1 (Grigorchuk, [30]). If @ € Q’, and more generally if w 
is not ultimately constant, then Gy has intermediate growth. 


We shall in fact prove much more, in preparation for the construction, 
in §6, of groups with prescribed growth. We mainly follow [5]. 


5.1. A Lower Bound via Algebras 


We begin by a general lower bound on growth, coming from the theory 
of Hopf algebras. Recall that the lower central series of a group G is 
defined by 71(G) = G and yn41(G) = [yn(G), G] for all n > 1. 


Theorem 5.2 (Grigorchuk, [31]). Let G be a finitely generated group, 
and assume that there is a subgroup H < G such that yn(A) 4 yn41(A) 
for alln EN. Then G’s growth function satisfies 


vo = exp(VR). 


In particular, if G is residually virtually nilpotent, then either G is 
virtually nilpotent (in which case yg is polynomial) or yg = exp(V/R). 


Before embarking on the proof, let us set up some algebraic notions. 
Let K be a field, and let G be a group. The group ring & = KG is the 
K-vector space with basis G, and multiplication extended linearly. It is a 
Hopf algebra: it admits a coproduct, which is an algebra homomorphism 
A: & > A&A defined on the basis G by gh g @g, a counit, which 
is an algebra homomorphism ¢: </ —-» K defined on the basis by g +> 1; 
and an antipode, which is an antihomomorphism 0: & > & defined on 
the basis by g +> g~!. Various axioms are satisfied; in particular the 
coproduct is coassociative: (1@ A)oA = (AQ@1)0A: & > (@)®, and 
cocommutative: A = 10 A, for t: o®? > of ®? the map r@ yh y@u 
flipping both factors. See [77] for details. 
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Denote by w the kernel of ¢, called the augmentation ideal. The 
associated graded of & is the vector space 


A= Se ja" 


Lemma 5.3. The associated graded & is a graded, cocommutative 
Hopf algebra. 

Proof. We have A(@w) < J ®wo+a ® &; so Aa) = <> {40's 
w”-*, Now given = € ool, choose x € w” representing it; 
write A(z) = > io yi @ wi with y,z% € w*; and set A(Z) = 
9% ® Mai where %,% € w'/w't! are the images of their 
respective representatives. It is easy to show that this definition does 
not depend on the choices of x, yj, z;; and it defines a coassociative and 
cocommutative coproduct as &’s coproduct was already coassociative 
and cocommutative. 


Let H be a Hopf algebra. An element x € H is called primitive if 
A(z) = £@1+1@z. The set of primitive elements in a Hopf algebra forms 
a Lie subalgebra of H for the usual bracket [x, y] = xy— yx. Conversely, if 
Lis a Lie algebra, then its universal enveloping algebra is a Hopf algebra 
whose primitive elements are L. Note that in characteristic p, one should 
consider restricted Lie algebras. 


Proposition 5.4 ([55, Theorem 6.11]). Let H be a cocommutative, 
primitively generated, graded Hopf algebra. Then it is the universal 
enveloping algebra of its primitive elements. 


Proof. Let P denote the Lie algebra of primitive elements in H. By the 
universal property of U(P), there is a map f : U(P) > H, which is 
graded, and surjective because P generates H. We show that f is also 
injective. Consider a homogeneous element z € U(P), say of degree n. 
If n = 0, then z ¢€ ker(f) if and only if c = 0. Assume then that f is 
injective on elements of degree < n. 

We have A(z) = 1®@2+2@1+4y for some y € U(P)en ® U(Ph)en. 
If f(z) = 0, then f(y) = 0; but we had assumed f to be injective on 
elements of degree < n, so y = O and x € P. By assumption, f is 
injective on P, so = O and therefore f is injective on elements of 
degree n as well. 


We apply these considerations to #7 = KG. HES we identify the 
primitive elements in KG. Let us define the series y} K@={9eG 
g —1¢€@"} of normal subgroups of G. 


Proposition 5.5. The space of primitive elements in KG is 


LEG = OOF @/rE1(G)) ®K. (1.13) 


n>1 
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Proof. The natural map gt g —1 from yX(G) to o"/a@"*"! extends 
to a Lie algebra isomorphism from yE(G) onto primitive elements of 
KG. 


In fact, yE(G) only depends on the characteristic p of K and, up to 
extension of scalars, ZS(G) depends only on p. Furthermore, it may 
be identified directly within G, and is a variant of the lower central 
series [40, 41]. Indeed one has y?(G) = G, and if p = 0 then 


Ve41(@) = (g € G|g' € [G, y(G)] for some t > 0), 
while if p > 0 then 
Vpus(G) = ([G, va (G)|{2? | & € ypnjpi(G)})- 


Proposition 5.6. Let S' be such that SUS~! generates G, and let UG,8 
denote the corresponding growth function. Let w(n) = dimg o"/w"t! 


denote the growth function of KG. Then 
vg,s(n) = w0) + w) +--+ w(n) for alln EN. 


Proof. Consider first an element z = 1—g € w, and write g = s;'--- ef 
as a product of generators and inverses. Using the identities 


1—-gh=(1-g)+(1-A)-(1-g)A— A), 
1—-g1=-(-g)+Q—-—9)0- 97, 


we get r= eG €;(1 — s;) modulo w?. 

The ideal w” is generated, qua ideal, by all « = (1 — g1)--- (1 — gn) 
with g; € G. By the above, o”/w"t! is generated, again qua ideal, 
by all « = (1 — 51)---(1 — sn) with s; € S. Now o"/o™*! has trivial 
multiplication, so the (1 — s1)--- (1 — sn) also generate o"/o"+! qua 
vector space. 

This generating set is contained in the linear span of Bg,g(n) C KG; 
so for 0 < i < n we have w(i) < dim(w’N KBg,5(n)) — dim@w** n 
KBg,s(m)) and the claim follows. 


Proof of Theorem 5.2. Consider a subgroup H such that yn(H) 4 
Yn+1(H) for all n € N. Without loss of generality, suppose H is 
finitely generated. For n € N, let P(n) be the set of prime numbers 
p such that (vn(H)/yn+1(4)) ® Fp € 0. Each P(n) is non-empty 
because Yn(H)/yn+i(H) is a finitely generated abelian group, and 
P(n+1) C P(n) because the commutator map (7n(A)/yn+1(4)) x H > 
Ynt1(A)/yn+2(H) is onto. There exists therefore a prime number p 
such that yn(H)/yn4i(7) ® Fp #€ 0 for all n € N. In particular, 
vi (H) # y2, (A) for alln eN, 
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Let K be a field of characteristic p. It follows that KH contains a 
primitive element x, of degree n for all n € N, namely ty = gp, — 1 for 
some gn, € vi (H ve 41 (4); so K4 contains m(n) linearly independent 
elements of degree n, where z(n) denotes the partition function. Indeed 
to every partition n = 11+---+%, with 1 < --- < 7% associate the element 
Xi, +++ Li,; these elements are linearly independent by Proposition 5.4. 
Therefore, the growth function of KH is at least m(n). 

It now follows from Proposition 5.6 that vy,g(R) = a(R) holds for 
any generating set S of H. Classical results on partitions [37] tell us that 
m(n) «x exp(./n); so a fortiori vg(R) = exp(VR). 


Note that, for Grigorchuk’s group G = Goj2, the quotients yn(G)/ 
Yn+1(G) have bounded rank, in fact 1 or 2, see [67]; so that no 
improvement on the lower bound can be obtained using Proposition 5.6. 


5.2. Metrics on Gy 


We already saw that the groups G, are contracting, namely if d(g) = 
(go, 91))% then gg and g; are shorter than g. We shall need a strength- 
ening of this property: we assign norms ||- || to the groups G, to obtain 
relations of the form 


2 
Iollow + Ilgillow < ll alle + || allo) (1.14) 


(a) 


with n» > 2 as large as possible. 
We do this by assigning norms € R, to the generators a, b,c, d of Gy, 
and extend || - ||~ to G, by the triangular inequality: 
Ilgllo = min{|| si] +--+ + [lsnll : 9 = $1--+ Sn, $1 € SH. 


For this purpose, consider the open 2-simplex 
3 1 
A= (B,y,5) €R smads.y,8)< 5 Bry teat : 
Its extreme points are (3. 53 0) and its permutations. A choice of py = 


(6, y,5) € A defines the following norm on the generators of Gy: 


lalla = 1 — 2max{B, y, 5}, 
blo =B— |lallo, lello = — llalle, Ildllo = 6 — |lalle- 


In particular, note that the triangular inequality ||z|lo + Ilyllo < Ilzylle 
holds for x, y € {b, c,d}, and is sharp for z = c, y = d. 
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The simplex A, and a 
3-cycle preserved by 


M2M1Mo 


(1.40) 


We extend the family of groups (Gy), <0,1,2;" to a family 
(Go) oe{0,1,2}2) Namely, the parameter space is now Q = {0, 1,2)” with 
the two-sided shift map 0: Q©. The group G, itself only depends on 
the restriction of w to N, but the norm on G, depends on the restriction 
of w to —N. 

Analogously to before, we denote by Q’ the subset of Q consisting 
of sequences that contain infinitely many 0,1,2 in both directions. For 
ow € Q', we construct the point py € A as follows. Consider the matrices 


it aA 2 0 0 2 0H 
Mo={0 2 0), M={111), Me=]0 20 
Oi: 2 00 2 oe oe 

Define then 
n: Ax {0,1,2}> (2,3), My: AS, pw: A> (0,9) 


by setting, for all 4 € {0,1,2} and all pe A, 


n(p,A) = (111): My(p)_ the ¢'-norm of M;(p), 


the projection of M,(p) to A, 
uU(p) = min{B, y, 5} the minimal distance of p to a vertex of A. 


Endow A with the Hilbert metric da(V1, V2) = log(V1, V2; V_, V+), 
computed using the cross-ratio of the points V,, V2 and the intersec- 
tions V_, Vz of the line containing V1, V2 with the boundary of A. 
The transformations M, are projective transformations of A, and are 
therefore contracting: 


Lemma 5.7 (Essentially [11]). Let K be a convex subset of affine space, 
and let A: K © be a projective map. Then A contracts the Hilbert metric. 
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If furthermore, A(K) contains no lines from K (that is, ACK) NL #4 
K€ for every line € intersecting K), then A is strictly contracting. 


Proof. Because A is projective, it preserves the cross-ratio on lines, 
so we have da(K)(A(V1), A(V2)) = dx(V1, V2) for all Vi, V2 € K. 
Furthermore, on the line @ through Vj, V2, the intersection points 
£0 0A(K) are not further from Vj, Vo than {V,, V_-} = €NdK; the 
Hilbert metric decreases as V+ are moved further apart from V1, V2, and 
this gives strict contraction under the condition A(K)N@ AK Ne. 


We are ready to define the points p, € A, and therefore the metrics 
| - I». Choose an arbitrary point p € A, and set 


Dy = lim Mit ) iM xe O+++0 Mis. (p). (1.15) 


N— CO 


Note that because the transformations M), are contracting, the limit p, 
is independent of the choice of p. Note also that, by our assumption that 
the negative part of w contains infinitely many 0, 1 and 2, the limit p, 
does not belong to the boundary of A. From now on, we write 


Nw = 1(Pw, @0)- 


Lemma 5.8. For all g € Gy with o(g) = (90,91), we have the 
inequality (1.14) 


2 
IIGollow + Ilgillow S all alle + Ilalle)s 
(7) 


and furthermore, if g € {b, c,d} then up to replacing g with a conjugate 
we have 


2 
lo llow + 91 low < —Ilgllo- 
No 
Proof. Without loss of generality, we suppose w9 = 0, and write py = 
(B,y,4) and pow = May Po = (f', y’, 5’). We have 


6,2y,26 1,2y,26 
(RIH ee, Cys 2a Sa 


Nw Nw 
Thus, 
Il Blow + oa) low = [lOllow = B' — IIallow = B’ — (1 — 28") = 38-1 
3 2 
= ee = —|ladllo, 
No No Nw 
2y 
IIcllow + loo(@) low = — = —llacllo, 
U] Nw 
r) 2 


2 
IIdllow + leo(@)llow = — = —Hlad]lo. 
u ra) 


LAURENT BARTHOLDI 44 


Now given g € G., write it as a word of minimal norm as g = a2, --- 
axe a’, with 2; € {b, c, d}, and the a’ mean that the initial and final “a” 
may be present or absent. Thus, ||gllo > |laz1llo +--- + llazelle — llalle- 
On the other hand, each “a” in the expression of g contributes nothing 
to go and gi, while each “z;” contributes an “z;” and a “wo(a;)” to go 
and gi, in some order. Summing together the inequalities above gives 
the claimed (1.14). 

The second claim follows, because the extra “||a||,” term occurs only 
if g both starts and ends with a letter in {b, c, d}, and this case can be 
prevented by conjugating g by its last letter. 


Lemma 5.8 can be used to prove statements on G, by induction. For 
example, 


Proposition 5.9. [fw € ’, then the action of Gy on the tree. 7 = X* 
is faithful, and it is transitive on each orbit X*. In particular, G, is 
infinite. 


Proof. We first use induction on || - ||~, simultaneously on all w € ’, 
to show that if g € G, acts trivially on 7, then g = 1. 

The induction starts by noting that the generators b,c,d act non- 
trivially by our assumption that @ contains infinitely many 0, 1,2. 
Indeed, without loss of generality consider b; let k € N be minimal such 
that w;,(b) £1; then b acts non-trivially on X**!, 

Consider then g € G, acting trivially on 7. In particular, g fixes X, 
so ¢o(g) = (90, 91)). By Lemma 5.8, both go and gj are shorter, so by 
induction they are trivial; thus g = 1 because ¢, is injective. 

To check that G, acts transitively on X a it suffices to show that the 
stabilizer H of 1 acts transitively on X*~!1; because then X*-!1G, = 
X*-l1(a) = X*. Now H contains b,c, d, y® for a letter y € {b, c, d} such 
that w(y) = a; and the action of 6, c,d, y® on w1 is w61, wel, wd1, wal 
respectively, so that the H-orbit of w1 is W1 for a Ggw-orbit W C Gua 
Again we are done by induction. 


Note that the action of G, is still faithful if @ only contains infinitely 
many of two symbols; it is not faithful if @ contains finitely many of two 
symbols. 


5.3. The G, Are Infinite Torsion Groups 


One of Burnside’s questions [13] asks whether there are infinite, finitely 
generated groups in which every element has finite order. The first such 
examples were constructed by Golod [26]; here, we show that the groups 
Gw are other examples: 


Theorem 5.10. Jfwe QQ’, then G, is an infinite torsion 2-group. 
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Proof. The group Gj is infinite by Proposition 5.9. We prove the claim 
by induction on ||g||. It is easy to check that the generators a, b,c, d 
all have order 2. Consider g € G,, with ¢,(g) = (90, 9))7. Ifa = 0, 
then go,g1 are shorter than g by Lemma 5.8 so have finite order, say 
2 ,2™ respectively. Then g has order 2™@*{"0-"}, Tf 7 = (0,1), then 
g° = 9091, 9190), and gog, is shorter than g again by Lemma 5.8, so 
has finite order, say 2”. Then g has order 2"+!. 


Note that if w contains finitely many copies of a symbol, then G, 
is not a torsion group anymore. In fact, suppose that w contains no 0; 
then w;(b) = a for all i, and the element ad has infinite order, since 
eo ((ab)2”) = (((ba)”, (ab)”)) for all n € Z. 

We also note that the groups G, closely resemble infinitely iterated 
wreath products; namely the map ¢y: Gy > Gow ?Sym(2) is almost an 
isomorphism: 


Definition 5.11. Let (G,)weg be a similar sequence of groups. It is 
called branched if every Gy has a finite-index subgroup K, such that 
Ke Sib (Ka): 


ow — 


If the subgroups K, are merely required to be non-trivial, then (G,) is 
called weakly branched. 

Every group in a (weakly) branched family of groups is also called 
(weakly) branched. A 


Proposition 5.12. The groups Gy are branched for all w € Q’. 


Proof. For each w € Q’, let 2% € {b, c, d} be such that w(2%») = 1, and 
set K, = (aw, a])%*. Choose also y, € {b, c, d} \ {aw}. Then 1 x Kou is 
normally generated by (1, [2e«, a])), and 


«1, [Sees a|)) = Po ([Lo, Yo) = bo ([Lo, al|%e, al Yo) bo Kw); 


the same computation holds for Kg x 1. 

We now show that the groups K, have a finite index. Consider first 
the quotient G,/(2) ©. This group is generated by two involutions a 
and y,, so is a finite dihedral group, because G, is torsion. It follows 
that (a) has finite index in G,. Then (2) & /Ky = (a) Ky has order 
2, so K, also has a finite index. 


Proposition 5.13. If G is a p-torsion weakly branched group, then it 
contains L = |e’ Cy as a subgroup. 


Sketch of Proof. Let (Go)weq be a similar family of groups, with G = 
G», and let Ky < Gy be the subgroups given by the condition that 
(Go)weg is branched. For each w € Q, let gq, € Ky be an element of order 
p, and let n(w) € N be such that g, acts non-trivially on Xjn()-1,, X 
+++ X& Xy; let Uy € Xgnw)-14 X ++: X Xp be a point on a non-trivial orbit. 
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Define then simultaneously and recursively Ly = (Lenw@)y4@vo; Go). It 
contains an element g, permuting p copies of Ljnw),,, 80 is isomorphic 
to L. 


If G contains no torsion, or torsion of different primes, analogous (but 
harder-to-state) results hold. In particular, by Exercise 4.10, branched 
groups satisfy no law. This recovers a result by Abért [1]. 


5.4. Lower Growth Estimates for G, 


The proof of Theorem 5.1 requires upper and lower bounds on the growth 
function of G,». A lower bound is easily provided by Theorem 5.2: the 
group G,, is infinite (Proposition 5.9) and torsion (Theorem 5.10), so 
cannot be virtually nilpotent, since nilpotent groups with finite-order 
generators are finite. However, in some cases a direct argument also 
gives the bound vg, (R) = exp(V/R). 

Define indeed maps 6: F > F by 


O(a) = aya if wo = +--+ = @p_-1 Fog, and wo(y) = a, wK(y) = 1, 
6,(c) =< for all x € {b,c, d}. 
f (1.16) 
A direct calculation shows that 0, induces a map 04: Gow > Gy, with 


bw Oa(g)) = |e |g 


for some * € (a, y) using the notation introduced in (1.16). Furthermore, 
(a, y) is a dihedral group of order 2*+2, 


Exercise 5.14. Prove that ®, is a group homomorphism. 


Let us now cheat, and assume that the element * is always trivial, 
rather than an element of a finite group of order 2"+?. If k is bounded, 
this is unimportant; however, if k is unbounded then an additional 
argument is really required. 

Denote by B,(R) the ball of radius R in G,, and abbreviate 
Uo(R) = vq,(R). Consider the map G?2,, > Gy given by (go, 91) 
64(90)" « On(g1). For Ro, Ri even, it defines an injective map Bow(Ro) x 
Bowo(R1) > Bo(2(Ro+ R1)), hence Usw(R)” < v»(4R) for all R even. We 
conclude (2+ 4") > Upky(2)2” > 52", so v(R) = exp(4 log 5VR). 


5.5. Upper Growth Estimates for G, 


We are ready to give an upper bound on the growth of G,. As before, 
we abbreviate u»(R) = vg,(R) for the growth function of G,. We start 
by a 


Lemma 5.15. For every A > 0, there is B € N such that, for all 
ow € QQ’, we have (Auw) < B. 
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Proof. It suffices to bound the number of elements of G, whose minimal 
expression has length < Aj, and has the form g = az) --- axe for some 
x; € {b, c, d}, since there are at most eight times more elements of norm < 
Apy, namely all {1, 6, c, d}g{1, a}. Now by definition g has norm at least 
£1», and there are at most 3° such gs, so one may take B = 8 - oe: 


Lemma 5.16. For all w € Q we have notes = ho + |lallo- 

Proof. Write py = (8, y,5), and assume without loss of generality wo = 
0 so Ucw = 2/N» min{y,5}. Consider the six orderings B < y <6 etc. in 
turn to check the inequalities 


Nokow = 2min{y, 6} > min{B, y,d}+(—max{B, y,5}) = do +(I4llo- 


Lemma 5.17. Let f be a positive sublinear function, namely f(n)/ 
n—>0O0asn— oo. Then f is bounded from above by a concave sublinear 
function. 


Proof. For every 6 € (0,1), let ng be such that f(n) — @n is maximal. 
Given n € R, let ¢ < 6 be such that n € [ng, n¢] with maximal ¢ and 
minimal @, and define f(n) on [ne, nz] by linear interpolation between 
(no, f(ne)) and (ne, f(nz)). Clearly f >f, and f(n)/n is decreasing and 
coincides infinitely often with f(n)/n, so it converges to 0. 


Proposition 5.18. There is an absolute constant B such that, for all 
wo€Q andallk EN, 


ok 
Vo(No 4 Nok-loloke) Ss sae 


Proof. We cheat, in assuming that the functions v,(R) are log-concave, 
ie. satisfy Up (Ro) ve(R1) < vw((Ro + R)/2)?. This assumption is in 
fact harmless, since each function uv, may be replaced by its log-concave 
majorand: the smallest log-concave function that is pointwise larger than 
Ue, given by Lemma 5.17. For details, see [5, §3.1]. 

For all 7 EN, set aj(R) = 18(R + 2)0,1,(R + Ueig)- 

The proposition will follow once we have proven the inequalities 
a;,(R) < ai41(R/Ngig)” for all i, R; because then 


Vo (Nw ioe Ngk-lyleoky) < a0 (Nw 2% Ngk-lylegky) 
2 
< a1 (Now oa Nok-lolgky) 


Se og (LK) < Be by Lemma 5.15. 


To simplify notation, we consider only the case 7 = 0, as all cases are 
the same. We have 


LAURENT BARTHOLDI 48 
ag(R) = 18(R + 2)uy(R + Mo) 
<18(R+2) >) 2tee(Ro)%ow(R1) 
Rot Ris 2 (R+potllalle) 


<36(R+2)? max — tgw(Ro)ew(R1) 
Ro+ Ris (R+to+llalle) 


< 67(3/nw)?(R + 2)7 teo((R + Mo + || allo)/Neo)? 


< (18(R + 2)/nw Yow(R/ No + How)” 
< a1 (R/n)?. 


We are now ready to conclude the proof of Theorem 5.1, by showing 
that the groups G, have subexponential growth. There are in fact 
different methods for this. Let 4, be the exponential growth rate of 
Gy, as in (1.5); we are to show Ay = 1. 

Let us first assume that @ contains infinitely many 0,1,2, so there 
are infinitely many positions k € N with w, = 0 and with 1 and 2 in 
between these positions. For these k, the point p,x,, € A belongs to 
the subsimplex {68 < y AB < 46}. Thus, 6B < 3 on that subsimplex, 
Noky = 3— 26 > 7/3 is uniformly bounded away from 2, and y,é > , 


SO [lgk+14 > qg- Thus, by Proposition 5.18 


log U%(R) 2* log B 


log Aw = lim = < lim inf =0 


No? Nok-lobokeo 


since on a subsequence the jx, are bounded away from 0, all terms 
2/Noig are bounded by 1, and infinitely many of them are bounded by 
6/7. If w is not ultimately periodic, then u,«,, > 0, but the convergence 
is slow enough (actually, at worst linear) that the same argument applies. 

A more “abstract” proof may be obtained by noting that the map 
@ + A» is continuous and bounded by 3, and that the proof of 
Proposition 5.18 gives loga,y < 2/nwlogAcw. Because the action of o 
on ’ is ergodic, we must have logA, = 0 for all w € Q’. 

Let us compute more precisely an upper bound for the growth of the 
first Grigorchuk group G(j2)~. Given that the sequence w = (012) 
is 3-periodic, we can find p, € A explicitly. The calculation is made 
even simpler by noting that po and p,2,, are cyclic permutations of py; 


: : . O01 ‘“ 
thus py is the normalised eigenvector of Mo - ( : 9 0). and its spectral 


radius is n+ * 2.46, the positive root of the characteristic polynomial 
T3_T2-2T-4. Thus, for the first Grigorchuk group we get Vn Nk per) < 


B2" for all k, and therefore 


Up (R) < exp(RI°8 2/ log n+) 3X exp( RO"), 
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6. Growth of Permutational Wreath Products 


The upper and lower bounds on the growth of G, are both of inter- 
mediate type exp(R%), but do not match. We consider, in this section, 
permutational wreath products based on the groups Gy. 

Choose a sequence w € Q' and a ray & € {0,1}%, and consider the 
ray’s orbit X = &G,. Choose a group H. Set then 


W.(H) = Hy Go. 


(Even though the notation does not make it clear, the group W,.(H) 
depends on &.) We shall show, in this section: 


Theorem 6.1. Jf H has subexponential growth, then so does W,(H). 


Theorem 6.2. Let ny * 2.46 be the positive root of T? — T? —2T —4. 
Let f: Ry > Ry be a function satisfying 


fQR) < f(R)? < f(n_R) for all R large enough. (1.17) 
Then there exists w € Q! such that vw, cy) ~ f. 


We will give more illustrations of the growth functions that may occur 
in 86.3. We content ourselves with the following: 


Theorem 6.3. For any finite group H, the group Wo12(H) has growth 
UWo1o(H) ~ exp( RI08?/lo8 n+), 


The proofs of Theorems 6.1 and 6.2 rely on estimates of the support 
c X of an element of W,.(H) of norm < R. Recall that every element 
of W.(H) may be written in the form cg with c: X — H and g € Gy; 
its support is {c € X : c(x) # 1}. To better understand the support of 
elements of W,,(4), let us introduce the following: 


Definition 6.4. Let G be a group acting on the right on a set X with 
basepoint €. For a word w = w,... we € G*, its inverted orbit is the set 


O(w) = {wir + we: OS 7 < Gh. 


If, furthermore, G is given with a metric || - ||, then its inverted orbit 
growth is the function A: R; > N given by 


A(R) = max{#O(w) : ||wl] < R}. A 


We write O,(w) and A,(R) in the case of G = Gy, with its metric 
| - lo. Thus, for example, taking € = 1°, the inverted orbit of acadab is 


O(acadab) = {€ acadab, € cadab, € adab, & dab, € ab, &b} = {1010, 1, 100} 


see the Schreier graph at the end of §4. 
Note that a basepoint & is implicit in the definitions; yet, 
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Exercise 6.5. Assume that G acts transitively on X. Show that A(R) 
depends only mildly of the choice of &, in the following sense: if €,&’ € X 
are two choices of a basepoint and A(R), A’(R) are the corresponding 
inverted orbit growth functions, then there exists a constant C' € R such 
that A(R) < A’(R+ C) and A’(R) < A(R+ C) for all R. 


Proposition 6.6. There is a universal constant C such that, for all 
aE andallk EN, 


OP Ns (ys ea dices) SC 2% 


Proof. For the upper bound, we note that Lemma 5.8 applies just as 
well to the group G, as to the monoid 


R:= S*/{b? = c? = d*? =1, bc = d, cd = b, db = ch, 


see Equation (1.11). Indeed, in a minimal-length representative of an 
element of R, the number of “a” is at least the number of b, c, d-letters 
minus one, and this is the only property required for Lemma 5.8. Now 
given a word w € S%, its inverted orbit may be read from the image of w 
in R. Every element of R has a unique reduced form: the reduced form of 
a word w € S* is the word w € S* obtained by replacing every subword 
equal to a left-hand side of a relation by the corresponding right-hand 
side. 

Without loss of generality and merely at the cost of increasing the 
constant C’, we may suppose § = 1%. We claim that the inverted orbit 
of a word w € S* coincides with the inverted orbit of its reduction w. To 
see this, consider w = w,... we € S*, a subword w;wj+1 equal to a left- 
hand side of a relation, and the word w’ obtained by replacing w; wj+1 
by the right-hand side of the relation. All terms €wj41... we with 1 4 j 
clearly appear both in O(w) and O(w’). For the remaining term in O(w), 
we have €wj41... we = Ewj+2... we because wj+1 fixes &, so this term 
also belongs to O(w’). 

If w € F satisfies ¢,(w) = (wo, wi))a and —€ = é’0, then 


Oo(w) © Oow(wo)0U Ooe(w1)1, 


where the inverted orbits Og, are computed with respect to the 
basepoint &’; and similarly if § = &’1. We therefore get 


< 
Ao(R) < Rot Ri <2/n,(R+Hlalle) (Aca(Ro) ot Aow(R1)). 
The same argument as in Proposition 5.18 finishes the proof of the upper 
bound. 
For the lower bound, it suffices to exhibit for all k €¢ N a word of length 
at most No - ++ Ngk-lwlgky and inverted orbit of size at least 2". For that 
purpose, define self-substitutions ¢, of {ab, ac, ad}*, for x € {0,1, 2}, by 
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fo: abt > adabac, act acac, ad +> adad, 
(1: abr abab, act» abacad, ad adad, 
2: abr abab, ach acac, ad +> acadab, 


and note that for any word w € {ab, ac, ad}* representing an element of 
F we have 


awa 
Z w if f4)(w) contains an even number of “a”, 
Bes(Guay(W)) = 4G? 
if 4 )(w) contains an odd number of “a”. 
W Ww 


In particular, ¢,, induces a homomorphism Gogg > Gy. 
By induction, we see that for any non-trivial w € {ab, ac, ad}* (repre- 
senting an element of G,x,,) we have 


Nobo +++ Sap (W)) = oe 


Note then that, if Z € N° count the numbers of ab, ac, ad respectively 
in w, then MtZ counts the numbers of ab, ac, ad respectively in C;(w). 
Indeed without loss of generality consider + = 0; then every ab in w 
contributes one each of ab, ac, ad to €;(w), while every ac and ad in w 
contributes two copies of itself to ¢7(w). 

Let as € {ab, ac, ad} be such that ||as||,«,, is minimal — recall the 
notation p, from (1.15): if pox, = (8, y,6) and B < y,6 then s = b, etc. 
Let W be the basis vector in R® with a “1” at the position that as has in 
{ab, ac, ad}: if B < y,d then W = (1,0,0)*, etc. Set w = bao + * Sap (8): 
We have A,,(w) > Qk and 


lwile = Zt = W* Mon. we Moy Po 


= No? Nek-ty W' Dyk = No Ngk-lpbloke 


Finally, let us introduce the “choice of inverted orbits growth” 
function, first generally for a group G, with given metric || - ||, acting 
on a set X with basepoint &: 


X(R) = F{O(w) : |lwl| < R}. 


This function counts the number of subsets that may occur as an inverted 
orbit of a word of length at most R. Because O(w) is a subset of 
cardinality at most R +1 of the Schreier graph X, and furthermore 
lies in the ball of radius R about € in X, we get the crude estimate 
x(R) < ( ae yr) based on the growth function vx ¢ of balls centered 
at € in the graph X. However, in the particular case of the groups Gy, 
we can do better: 


Proposition 6.7. There is an absolute constant D such that for all 
w € Q' and allk EN we have 


k 
Lo (Ne aid Nok-loloky) < D? : 
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Proof. Consider w € G% with $o(w) = (wo, w1)). The inverted orbit of 
w is determined by the inverted orbits of wo and w1, two words of total 
ow-length at most 2/n_(||wlle + lalla) by Lemma 5.8. Therefore, 


LotR) < a Dow( Ro) Zow(R1); 
Rot Ris > (R+llalle) 


and the same argument as in Proposition 5.18 applies. 


6.1. The Growth of W,(H) 


We start by general estimates on the growth of a permutational wreath 
product: 


Proposition 6.8. Let H be a group with growth function vy, and 
suppose that uy is log-concave. 

Let G be a group acting transitively on a set X with basepoint €, and 
let vg denote the growth function of G. Denote the inverted orbit growth 
of G on (X,&) by A, and denote its inverted orbit choice growth by X. 

Consider the wreath product W = H 2x G, generated by SU T@é for 
the generating sets S,T of G,H respectively. Then 


vq(R)uy(R/A(R))A® < vy (BR), 
vw(R) <vg(R)vy(R/A(R))A® (2R)A F(R), 
vw(R) <ug(R)\(#H)S™ Z(R) if H is finite. 


Note that the assumption that vy be log-concave is mild, owing to 
Lemma 5.17. 


Proof. We begin by the lower bound. For every R € N, consider a 
word w € G* of norm < R realizing the maximum A(R); write O(w) = 
{z1,..., 2%} for k = A(R). Choose then & elements aj,...,a, of norm 
< R/k in A. Define f € Oy H by f(a) = aj, all unspecified values 
being 1. Then wf € W may be expressed as a word of norm R+|a,|+- +--+ 
|ax| < 2R in the standard generators of W, by inserting a1@é,..., a,@é 
appropriately into the word w. 

Furthermore, different choices of a; yield different elements of W. 
Finally multiplying wf with an arbitrary g € G of length at most R, we 
obtain vg(R)uy (R/k)* elements in the ball of radius 3R in W. 

For the upper bound, consider a word w of norm R in W, and let 
f € )oy A denote its value in the base of the wreath product. The 
support of f has cardinality at most A(R), and may take at most (R) 
values. 

Write then sup(f) = {21,...,2%} for some k < A(R), and let 
aj,..., a, € H be the values of w at its support; write €; = ||a;|]. 

Since )> €; < R, the norms of the different elements on the support of 


f define a composition of a number not greater than R into at most k 
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summands; such a composition is determined by k “marked positions” 
among R+k, so there are at most Ce =) possibilities, which we bound 
crudely by (2R)*. Each of the a; is then chosen among vy (;) elements, 
and (by the assumption that vy is log-concave) there are [| vy(€:) < 
UH (R/k)* total choices for the elements in H. 

We have now decomposed w into data that specify it uniquely, and we 
multiply the different possibilities for each of the pieces of data. Counting 
the possibilities for the value of w in G, the possibilities for its support 
in X, and the possibilities for the elements in H on its support, we get 


vw(R) 3 vg(R)va(R/k)* (2R)* X(R), 
which is maximised by k = A(R). 


Finally, if H is finite, then we may more simply bound the possible 
values of f by (#4H)*. 


Corollary 6.9. Let G,H be groups of subexponential growth. Let G 
act transitively on a set X with basepoint —, with sublinear inverted orbit 
growth and subexponential inverted orbit choice growth. Then the wreath 
product W = Hx G has subexponential growth. 


Proof of Theorem 6.1. Assume that H has subexponential growth; 
then by Lemma 5.17 there exists a log-concave subexponentially growing 
function vy bounding the growth of H from above. 

By Propositions 6.6 and 6.6, the function A is sublinear and the 
function & is subexponential. By Proposition 5.18, the growth of Gy 
is subexponential. Corollary 6.9 then shows that W,,(H) has subexpo- 
nential growth. 


In the special case of H finite and G = Gy, Proposition 6.8 gives the 


Corollary 6.10. Let H be a non-trivial finite group. There are then two 
absolute constants F, E > 1 such that the growth function v of W,(H) = 
Aix Gy satisfies 


k k 
E? < U(Ne ee Nok-lobkokw) < Re? id 


Proof. Take together the upper bound on the growth of G,, from Propo- 
sition 5.18, the bounds on the inverted orbit growth from Proposition 
6.6, and the choices for the inverted orbits from Proposition 6.7. The 
conclusion follows from Proposition 6.8. 


Proof of Theorem 6.8. This follows directly from Corollary 6.10, using 
the fact that n,i, = n+ for allie N. 


6.2. Proof of Theorem 6.2 


Our approach will be to construct, out of the function f satisfying (1.17), 
a sequence w € {2,012} with long stretches of 2 when f grows fast, and 
long stretches of 012 when f grows slowly. 


LAURENT BARTHOLDI 54 


We start by introducing some shorthand notation. For a finite sequence 
@=M0..-.@n—1 € {0,1,2}” and p € A, we write by extension 


Me = Mey -** Men AD 
and 
n(p,@0 ---@n—1) = 1(p, 0) Map, 1) +++ Mag...on_2 Ps On—1)- 


For w € {0,1,2}4, recall the construction of p, € A from (1.15), and 
No = N(Pw,@0) and [La = (py). Assume that a sequence w € {0, 1, 2}4 
is under construction, and that there exists k € N such that w; has been 
determined for all i < k. Then py, ny and fy are determined, and so 
ale Deigs Noiw: Moi for all 1 < k. We abbreviate 


Pi = Potw) Ni = Notws Mota: 
Lemma 6.11. [If the restriction to N of the sequence w has the form 
@ = (012) 291 (012)"2 292 (012) 293 |. , (1.18) 
with 11, 91, 12,92,°:: = 1, then the wy are all bounded away from 0. 


Proof. In fact, the image of Mo12 is the open triangle spanned by 
(3, 5.4): (2, 2,2) and Gaeta) so after each 012 the wz belongs 
a J 
to (77; x). _ 
The image of that triangle under My: is contained in the convex 


quadrilateral spanned by (3, 3,3); (G7 17+ 17)» (qo 4+ 9) and (5, 7p, 5), 
So [4g € (%, §) for all k. 


The heart of the argument is the following lemma, which shows that 
7 approaches very quickly its limiting values 2 and n+ as the sequence 
contains long segments of 2 or of 012: 


Lemma 6.12. There exist constants A’ <1, B’ > 1 such that, 
1. For allp€ A and allneN, 
n(p, (012)") = ny" A’; 
2. Forallpe A andallneéeN, 
n(p, 2”) < 2"B". 


Proof. Let U denote the image of Moji2, and let py € U denote the 
fixed point of Mo12. Note first that n(—, 012) is differentiable at p,, and 
that Mo12 is uniformly contracting on U; let p < 1 be such that Moi2 
is p-Lipschitz on U, and let D be an upper bound for the derivative of 
log n(—, 012) on U. Recall that n? = n(p+,012). 
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For all k € N, write py = M gy) (p). For k > 1 we have ||py — pyll < 


p*-1, so |logn (pp, 012) — 3logny| < Dp’! 


| log n(p, 012) — 3logn+| < 3log3. Therefore, 


, while for k = 0 we write 


n-1 
| log n(p, (012)") — 3n log ny| < 3log3 + > | log n(pp, 012) — 3log n+| 
k=] 
< 3log3+ D/(1 — p) 


is bounded over all n and p. The estimate (6.12) follows, with A’ = 
3° exp(D/(1 — p)). _ 

For the second part, consider py = Mox(p), and note that p, converges 
at exponential (Euclidean) speed to a point poo on the side {6 = 5} of 
JA, since (9M2/95)(*,*, 4) = 4; so we have ||\p_ — Pooll < p*—! for 
some p < 1. As above, n(—, 2) is differentiable in a neighbourhood U of 
{6 = ah; and the derivative of log7(—,2) is bounded on U, say by D. 
Recall that n(p,2) = 2 for all p € dA. The same computation as above 
yields (6.12), with B’ = 3exp(D/(1 — p)). 


We now reformulate the statement of Theorem 6.2 as follows. Set 
g(R) = log f(R), so that we have 


g2R) < 2g(R) < (n+) (1.19) 


for all R large enough. For simplicity (since growth is only an asymptotic 
property) we assume that (1.19) holds for all R. Without loss of 
generality (since we are allowed to replace g by an equivalent function), 
we also assume that g is increasing and satisfies g(1) = 1. 

We are ready to construct w. Fix arbitrarily the value of w on its 
negative part, say wn = (012)~°. This determines an initial metric 
po € A. Out of the function g, we will construct a sequence w such that, 
for constants A, B, we have 


g(n(po, @0 - -- @k-1)) 
9k 

in fact, it will suffice to obtain this inequality for a set of values ko, k1,... 

of & such that sup; (kj41 — kj) < oo. Indeed, the orbit p; of po in A’ will 

remain bounded, so we will have u(p;) € [C,1] for some C > 0. By 

Corollary 6.10, 


A< < B for all k; (1.20) 


E®* < v(n(po,@0...@p-1) ite) < v(Bg!(2*)), 
v(ACg71(2*)) < u(n(po, 00. -@p—1) eR) < F” 
and therefore v(R) ~ exp(g(R)) = f(R). 


Proof of (1.20). We will extend the sequence w to be, on the positive 
integers, of the form (1.18), 


@ = (012) 29! (012). 272 (012) 293 |, 
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with 71,71, 22,J2,--: => 1. The wz are bounded away from 0 by Lemma 
6.11. 

Assuming by induction that w’ = wo9...@,—1 has been constructed, 
we repeat the following: 


e while g(n(p0,@’)) < 2", we append 012 to a’; 
e while g(n(po,@’)) > 2", we append 2 to a’. 


For our induction hypothesis, we assume that the stronger condition 


1 
52" < g(n(p0,0")) = 2 
holds for each & of the form 7 + 7) +---+%m+ jm, and that 
2" < g(n(po,0')) < 3°2" 
holds for each k of the form 7% + 71 +---+ %m; these conditions apply 
whenever w is a product of “syllables” (012)" and 2/¢. 
Consider first the case 52k < g(n(p0, @’)) < 2. and let n be minimal 
such that g(n(po,@'(012)”")) > gk+3n_ Then, for all i € {1,...,n}, 


Lemma 6.12(6.12) gives n(po,@'(012)") > n(po,@’)n3* A’. Let u € N be 
minimal such that A’ > n,“; this, combined with g(nR) > 2g(R), gives 


9(n(po; 0" (012)*)) = g(n(po, oni“) = a waht", 


By minimality of n, we have g(n(po,@'(012)"—!)) < 2**+8(™—-D, since g 
is sublinear and 7 < 3, we get 


g(n(po, @ (012)")) < 332k+3". 


Consider then the case 2 < g(n(po,o’)) < 3°2", which is similar, 
and let n be minimal such that g(n(po,o'2”)) < 2*+". Then, for all 
i € {1,...,n}, Lemma 6.12(6.12) gives n(po,@'2") < n(po,@’)2'B’; this, 
combined with g(2R) < 2g(R), gives 


g(n(po, @'2")) < 332*** B’, 


By minimality of n, we have g(n(po,@/2"—!)) => 2**+"-1. because g is 
increasing, we get 


1 
g(n(po, @'2")) = ee 


We have proved the claim (1.20), with A = 2~!~” and B = 3°B’. 


Remark 6.13. The construction of w from f is algorithmic, in the 
following sense. The initial point po may be computed to arbitrary 
precision by an algorithm. If there exists an algorithm that computes 
values of f, then there is an algorithm that, with k € N as input, 
computes g(n(po,@0...@,—1)) to arbitrary precision; so there is an 
algorithm that computes the digits of a. 
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It then follows via Theorem 1.10 that the groups G, and W,(H) = 
Af 2vx Gz are recursively presented, for recursively presented H. 


6.3. T[lustrations 


We now consider illustrations of Theorem 6.2, and examples of growth 
functions that may occur for groups W,(C2). In fact, the examples 
can be constructed in both directions: either choose a “nice” function 
f that satisfies (1.17), or choose a “nice” sequence w and estimate the 
corresponding growth using Corollary 6.10, Lemma 6.11 and Lemma 
6.12. We follow both approaches. 


e For every a € [log 2/ log 7+, 1], there is a group of growth ~ exp(R®). 
Furthermore, for a dense set of a in that interval, there exists a 
periodic sequence w such that W,,(C2) has growth ~ exp(R®). 

For every a < 6 € [log 2/logn+,1), one may construct a function 
f satisfying (1.17) that coincides, on arbitrarily large intervals, 
sometimes with the function exp(R®%) and sometimes with the 
function exp(R*). Therefore, there is a group whose growth function 
accumulates both at exp(R®) and at exp(R®). This recovers a result 
by Brieussel, see [12]. 

e There exist groups of growth ~ exp(R/ log R), of growth ~ exp(R/ 
log log R), of growth ~ exp(R/ log---log R). 

e Consider conversely the sequence w = (012)2!(012)27(012)2° 
(012)24... Among the first k entries, approximately Vk instances 
of 012 will have been seen; therefore n(po,@o ... @%—1) © gktOvk 
This gives a growth function of the order of 


exp (R/exp(O(1)/log R)). 


Consider next the sequence w = (012)2!(012)27(012)2*(012)2° ... 
Among the first k entries, approximately log k instances of 012 will 
have been seen; therefore, n(p9,@o...@%-1) © 2ktOMlosk | This 
gives a growth function of the order of 


exp (R/(log RYO). 


Consider further the sequence w = (012)22"(012)2?° (012)22" 
(012)22" .... Among the first k entries, approximately loglogk 
instances of 012 will have been seen; therefore, n(po,@o ...@k—1) © 
gk+OU)loglogk This gives a growth function of the order of 


exp (R/(og log RYO), 


These constructions generalise easily to give “nice” sequences 
w such that W,(C2) has growth of the order of exp(R/(log:-- 
log RPO), 
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e Consider the Ackermann function 
n+1 if m = 0, 
A(m,n) = ¥ A(m — 1,1) ifm > 0 and n=O, 
A(m—1,A(m,n—-1)) ifm>Oand n> 0, 


and set B(n)=A(n,n). Then there exists a group whose growth 
function is ~ exp(R/B(R)7!); this last growth function is faster 
than any subexponential primitive-recursive function. 

Considering w = (012)240- (912)244,) (912)2422) gives a 
group W,, whose growth function is subexponential, but at least as 
fast as exp(R/B(R)~!); 


7. Imbeddings and Subgroups 


A classical result by Higman, Neumann and Neumann [39] states 
that every countable group imbeds in a finitely generated group. It 
was then shown that many properties of the group can be inherited 
by the imbedding: in particular, solvability (Neumann-Neumann [61)), 
torsion (Phillips [66]), residual finiteness (Wilson [80]) and amenability 
(Olshansky-Osin [63]). 

Seen the other way round, these results show that there is little 
restriction, apart from being countable, on the subgroups of a finitely 
generated group; even if that group is furthermore assumed to be 
residually finite, amenable or solvable. 

On the other hand, very strong restrictions exist on the subgroups of 
a virtually nilpotent group: they are all finitely generated, for example; 
see Exercise 7.6 

Since by Gromov’s Theorem 2.11 the finitely generated virtually 
nilpotent groups are precisely the groups of polynomial growth, we 
naturally ask what conditions are imposed on the subgroups of a group of 
subexponential word growth. As we shall see, there are essentially none. 

The certainly do not share the above property of virtually nilpotent 
groups: there are torsion branched groups of subexponential growth such 
as the group G,. They contain iterated wreath products, and as per 
Proposition 5.13, also contain infinitely generated subgroups. 

Let us say that a group has locally subexponential growth if all of 
its finitely generated subgroups have subexponential growth. Clearly, 
if G has subexponential growth, then all its subgroups have locally 
subexponential growth. This is the only restriction, and the objective 
of this section is to prove the following result: 


Theorem 7.1 ((5]). Let B be a group. Then there exists a finitely gen- 
erated group of subexponential growth in which B imbeds as a subgroup 
if and only if B is countable and locally of subexponential growth. 


For example, this implies that there is a group of subexponential 
growth containing Q as a subgroup. 
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7.1. Neumann’s Proof 


By way of motivation, we start with the classical result by Higman and 
the Neumanns: 


Theorem 7.2 (Higman, B.H. Neumann and H. Neumann [39]). Every 
countable group imbeds in a finitely generated group. 


We shall not follow the original proof (which proceeds by a sequence 
of “HNN extensions”), but rather that by the two Neumanns [61], 
which uses wreath products. It follows immediately from combining the 
following two propositions: 


Proposition 7.3. Every countable group B imbeds in the commutator 
subgroup |G, G] of a countable group G. 


Proposition 7.4. For every countable group G, there exists a 
2-generated group W such that [G, G] imbeds in [W, W]. 


Proof of Proposition 7.3. Consider the following subgroup G of the 
unrestricted wreath product BZ. The group G is generated by Z and, 
for all b € B, the function j,: Z > B defined by fp(m) = 6”. Denoting 
by ¢ the generator of Z, we see that [t, fy] is the constant function 6; so 
B is in fact imbedded in [¢, G]. 


Exercise 7.5. Could we also have defined f,(m) = 6 for m > 0 and 
fp(m) = 1 for m < 0? What would be the advantages and disadvantages 
of this alternative construction? 


Proof of Proposition 7.4. Consider the following subgroup W of the 
unrestricted wreath product GZ. Denote by u a generator of Z, and by 
{b1, b2,...} a generating set of G. Choose a sparse-enough sequence of 
elements 21, 22,... of Z, and define f: Z > G by f(a) = 0;, all other 
values being trivial. The group W is then W = (f, u). 

Let us spell out below what it means to be “sparse enough”. We write 
Z additively. Because as per Proposition 7.3 we only need to imbed [¢, G] 
in W, we may set f(0) = ¢ and a sufficient condition on the 2; € Z is that 
x #0 for all 4; all x; are distinct; and 2; + 2; ¢ {0, xg} for all 7,7,k EN. 
One then sees that [f, f“ “| is a function supported only at 0, with value 
[t, b;] there. This defines the imbedding of [t, G] in W. 


Note that the group W contains the standard wreath product BZ, 
so always has exponential growth. 

The above construction shows that every countable solvable group 
imbeds in a 2-generated solvable group. 


Exercise 7.6. Show first that not every countable nilpotent group 
imbeds in a 2-generated nilpotent group. 
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Show then that every finitely generated nilpotent group imbeds in a 
2-generated nilpotent group. 


Here is a useful, small improvement on Theorem 7.2: 


Theorem 7.7. Let p > 5 be an integer. Then every countable group 
imbeds in a 2-generated group both of whose generators have order p. 


Proof. Piggybacking on Propositions 7.3 and 7.4, it suffices to consider 
a 2-generated group G = (x, y), and to imbed [G, G] into a 2-generated 
group W with generators of order p. Write Cp) = (t|t?), and define 
f: Cy > G by 


f@=2, fa )=y, fa?) =y7'271, f(g) =1 for all other g € Cp. 


Consider then the group W = (t, ft). It is clearly generated by two 
elements of order p, since in (ft)? all the coordinates contain some cyclic 
permutation of the product x-y-y~ta«~!. Now W contains f, so it also 
contains [f,f‘], which is the function Cp > G taking value [z, y] at 1 
and 1 elsewhere. Furthermore, conjugating [f, f] by an arbitrary word 
in f and f*, one obtains the function taking value an arbitrary conjugate 
of [z, y] at 1; so W contains [G, G]@1. 


Exercise 7.8. Where have we used the assumption that p > 5? Can 
you improve the above result to arbitrary p > 3? Can you imbed any 
countable group into a group generated by an involution and an element 
of order p? 


7.2. Finite-Valued Permutational Wreath Products 


Our goal is, starting from a countable group B locally of subexponential 
growth, to construct a finitely generated group W of subexponential 
growth containing B. We take inspiration from Neumann’s proof given 
above, with two modifications: first, we consider permutational wreath 
products rather than standard ones; secondly, we consider finite-valued 
permutational wreath products: 


Definition 7.9. Let H be a group acting on a set X, and let H be 
a group. Their finite-valued permutational wreath product is the group 
A ue G, defined as the extension of functions X — H with finite image 
by G: 


HY G = {(,9) € HX x G: #G(X) < oo}. K 


Note that it is a subgroup, because if (6, 9)~!(¢',9') = (¢”,97'9’) 
then $”(X) € 6(X)~!4'(X) is finite. Clearly, we have 


Vie aCe ee tea Be (6 ek 
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We also introduce a condition on imbeddings that guarantees control on 
growth: 


Definition 7.10. Let B be a group. A group G is called hyper-B if it 
is a directed union of finite extensions of finite powers of B. A 


Clearly, if a group B is locally of subexponential growth and a group 
G is hyper-B, then G is also locally of subexponential growth. Indeed, 
for every finite subset S of G there is a finite extension of a finite power 
of B that contains S. 


Lemma 7.11. Let G be a hyper-B group, and let H be a hyper-G 
group. Then H is hyper-B. 


Proof. Consider h € H; then h belongs to a finite extension of a finite 
power of G, which may be assumed of the form G?F for a finite group F. 
Let us write h = of with ¢: F > Gand f € F; then ¢(/) belongs for all 
f € F to a finite extension of a finite power of B, which can be assumed 
to be the same for all f. This extension may be assumed to be of the form 
BE for a finite group E. It follows that h belongs to Bip, pr (EF? F), 
a finite extension of a finite power of B; so H is hyper-B. 


Lemma 7.12. If H is a hyper-B group and U is locally finite, then 
Hi: U is a hyper-B group. 


Proof. We first show that H xY- U is hyper-H. By hypothesis, U is a 
directed union of finite subgroups FE. The partitions Yo of U into finitely 
many parts also form a directed poset; and for every such partition Apo 
and every finite subgroup E < U there exists a finite partition FY of 
U that is invariant under EF and refines Yo, namely the wedge (= least 
upper bound) of all E-images of Wo. 

Consider now the directed poset of pairs (F, Y) consisting of finite 
subgroups E < U and E£-invariant partitions of U. Consider the 
corresponding subgroups H? x E of H&E U. If (L,Y) < (E',F’) 
then H” » E is naturally contained in H”’' x E’, so these subgroups of 
H2&Y- U form a directed poset, which exhausts H &Y- U. 

It follows that HEY U isa hyper-H group, and we are done by Lemma 
7.11. 


7.3. Imbedding in the Derived Subgroup 


Our main goal, in this section, is to prove the following proposition, 
which replaces Proposition 7.3. 


Proposition 7.13. Let B be a group. Then there exists a hyper-B 
group G such that [G, G] contains B as a subgroup. 

If B is infinite, then G may further be supposed to have the same 
cardinality as B. 
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Lemma 7.14. Let B be a group. Then there exists a subgroup C of B, 
containing [B, B], such that B/C is torsion and C'/[B, B] is free abelian. 


Proof. B/|B, B| ®z Q is a Q-vector space, hence has a basis, call it X. 
It generates a free abelian group ZX within B/|B, Bl], whose full 
preimage in B we call C. Then B/C @z Q=0 so B/C is torsion. 


We set up the following notation for the proof of Proposition 7.13. 
We choose a subgroup C' < B as in Lemma 7.14 and write T := B/C. 
We choose a basis X of C/[B, B], for every x € X we choose an element 
bz € C' representing it, and we define a homomorphism 0;: C > (bz) 
C B, trivial on [B, B], by 6c(bz) = b+ and 6,(by) = 1 forall y Axe X. 
In particular, we have for all be C 


b- | [ (6) € [B, B] 
cexX 
and the product is finite. 
We write 7: B > T the natural projection, and define a section o of 
xz with the following property, which we single out as a lemma: 


Lemma 7.15. There exists a set-theoretic sectiona: T > B such 
that, for every t € T, the subset {a(tu)o(u)—! : u € T}[B, B] of B/[B, B| 
is finite. 


Proof. Because every abelian torsion group is the direct sum of its 
p-subgroups, we may first define the section on each of T’s 
p-components, and then extend it to T multiplicatively (in any order). 

We therefore suppose that JT is a p-group. Recall the notation 
Qn(T) = {t € T : t?” = 1}. Each quotient Vp := Qn(T)/Qn-1(T) 
is a vector space over F,, and the homomorphism t + t? induces an 
injective linear map V;, > Vyn—1. Choose inductively subsets X1, X2,... 
of T such that X, maps to a basis of V,, and such that t + ¢? induces 
an injective map Xp) > Xn-1. Set X = U,, Xn, and give an arbitrary 
total order to X. Choose for each x € X a m-preimage o (x) € B. 

Given that T is torsion, every element t € T belongs to Q,(T) for 
some n € N, so can be uniquely written as a product t = 2]!---+ 2” 
with ordered z; € X and 0 < a; < p for all 7. Extend then o by 
a(t) = o(21)%1 +++ O(a). 

Consider now t € T, and write it in the form t = a}? --- am" as above. 
Extend {21,...,2n} to a finite subset Y = {21,...,2n,..., 25} of X, by 
adding all p/th powers of all 2; to it. The set 


T! = {a} ---af° :0< yi <p) 


is then a finite subgroup of T. Consider next u € T, and note that it 
may be written uniquely in the form u = yf Pr with yg e X\ Y 
and z € T’. Then tu = yp Bish yom (zt) and this representation is unique; 


so o(tu)o(u)—![B, B| belongs to the finite set {o(t’)[B, B] : t! € T’}. 
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Exercise 7.16. Rephrase Lemma 7.15 in terms of the cohomology of 
T with coefficients in ZX. 


Let F be a locally finite group of cardinality > #X, and fix an 
imbedding of X in F'\ {1}. As a first step, we consider the group 
Go = BY: (T x F), and define a map ®9: B > Gp as follows: 


b if f =1, 

@o(b) = (¢, 2(6), 1) with (t, f) = Of (a(t) ba (tx (b))—1) iff eX, 
1 otherwise. 
(1.21) 


Lemma 7.17. The map ®o is well defined and is an injective 
homomorphism. 


Proof. To see that ®o is well defined, note that the argument 
o(t)bo (tx(b))~4 belongs to ker(7) = C, so that 67 may be applied 
to it. Note then that, by Lemma 7.15, the expression a (t)bo (tx (b))~! 
takes finitely many values in C/[B,B], so that ¢(t,f) takes finitely 
many values for varying ¢ and fixed f. Finally, 67(o (t) bo (t2(b))~1) = 1 
except for finitely many values of f € X. In summary, the function 
¢ € B™*F is such that @(t,f) takes only finitely many values. 

It is clear that ®g is injective: if 6 # 1 and ®9(b) = (¢,z2(b), 1), 
then ¢(1,1) = 6 4 1. It is a homomorphism because all 67 are 
homomorphisms. 


Lemma 7.18. We have ®o(C) < [Go, Gol. 


Proof. If b € [B,B], then clearly ®o9(b) € [Go, Go]. Because C' is 
generated by [B, B] U {br}cex, it suffices to consider b = bz. 
We define g € Go by 
b iff=1 
g= (10 with wep yr ETE” 


otherwise. 


Then o(b;) = (¢,1,1) with (t,1) = b, and ¢(t,z) = b=!, all other 


wt? 
values being trivial, according to (1.21); so, as was to be shown, 


Oo(bz) = (6,1, = Ce" - ¥,1,D =[0, 1,271), g] € [Go, Gol. 
We finally define 


G = Got Q/Z) 
and amap ®: B > G by 
®(b) = (¢,0) with ¢(r) = ®g(6) for all r € Q/Z. 


Lemma 7.19. The map ® is an injective homomorphism, and ®(B) < 


[G, G]. 
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Proof. Clearly ® is an injective homomorphism, as ®g is an injective 
homomorphism by Lemma 7.17. 

We identify Q/Z with QN [0,1). For every n € N, consider the map 
W,: B— G defined by 
. ®o(b if re |0,1/n), 
Wn(b) = (G0) with o(r) = 40 ae 

1 otherwise; 

so ® = Wy. We know from Lemma 7.18 that W,(C) is contained in 
[G, G]. 

Consider now b € B. Since B/C is torsion, there exists n € N such 
that b” € C. We define g € G by 


g = (W,0) with w(r) = &o9(b)"'™! for r € [0,1I)NQ. 


Let us write h = ®o(b), and consider the element [(1,1/n), g]|-¥n(b”) = 
(¢,0). If r € [0,1/n) then 6(r) = W(r -— 1/n)~tw(ryh” = h, while if 
r € [1/n,1) then o(r) = w(r — 1/n)-!w(r) = h; therefore 


®(b) = [(1,1/n), g] - Yn(b") €[G, G]. 


Proof of Proposition 7.18. The first assertion is simply Lemma 7.19. 
For the last one: if B is infinite, we wish to find a subgroup H of G 
with the same cardinality as B, such that ® maps into [H, H]. For each 
b € B, choose a finite subset $, of G such that ®(b) € [(S5), (Sp)], 
and a subgroup Gp, containing S$), that is virtually a finite power of B. 
Consider the group H generated by the union of all the Gy. As soon as 
B is infinite, all G, have the same cardinality as B, and so does H. 


7.4. Spreading, Stabilizing, Rectifiable Sequences 


We also extend Proposition 7.4 by replacing the wreath product GxZ by 
permutational wreath products of the form Gwx P for a group P acting 
on a set X, and considering subgroups of the form W = (f, P) for a 
function f: X > G. 

We already encountered in Proposition 6.8 sufficient conditions for 
such a group W to be of subexponential growth, in case f is finitely 
generated. As we shall see, the group W may also have subexponential 
growth if f is infinitely supported, but its support is sufficiently sparse, 
in a sense that we describe now. 

In this section, we assume a finitely generated group P = (S) acting 
on the right on a set X has been fixed. We use the same notation for X 
as a set and as a Schreier graph, namely as the graph with vertex set X, 
and for all sc € X,s € S an edge labelled s from x to zs, see Definition 
1.8. We denote by d the path metric on this graph. 


Definition 7.20. A sequence (29, 21,...) in X is spreading if for all R 
there exists N such that if 7,7 => N andi #7 then d(%, aj) = R. A 
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Example 7.21. If all x; lie in order on a geodesic ray starting from 2 
(for example, if X itself is a ray starting from 2g) and for all i we have 
d(x, %i41) = 2d(x, 2%), then (a;) is spreading. 


Exercise 7.22. Show that a sequence (20, 21,...) in X is spreading 
if and only if for all R there is N such that if 7 # 7 andi > N then 
d(x, 75) = R. 


Definition 7.23. A sequence (2;) in X locally stabilises if for all R 
there is N such that if 1,7 => N then the S-labelled radius-R balls 
centered at 2; and x; in X are isomorphic as labelled graphs. A 


Definition 7.24. A sequence of points (z;) in X is rectifiable if for all 
i,j there is g € P with ajg = 2; and agg A 7% for all k ¢ {i, €}. A 


For example, if X = Z and P = Z acting by translations, then © = 
{2' : i € N} is rectifiable, as 27 — 2¢ = 26 — 2* only has trivial solutions 
t=k,j=landi=j,k = £. It is also spreading and locally stabilizing. 


Exercise 7.25. Show that the sequence © = (2;) C X is rectifiable if 
and only if for all 7,7 there exists g ¢ P with ajg = 7 and UN Lg Cc 
{aj} U fixed points(g). 


Definition 7.26. Fix a point z € X. A sequence (g;) in P is 
parallelogram-free at z if, for all 1,j7,k,@ with « A 7 and gj A k and 
k A£ and ¢ 47 one has 29; ' 939, Ge # 2. A 


Lemma 7.27. IfzeX and (gi) ts parallelogram-free at z, then (zg; ') 
is a rectifiable sequence in X. 


Proof. Set x = zg; for all 1 e N. Given 7,7 € N, consider g = G9) 


so xg = 2;. If furthermore we have rpg = a, then we have zg, . Gi 9; 
ge = 2, so either k = 7, or 1 = j, which implies k = @, or 7 = €, which 
implies k = i, or €= k. In all cases k € {7, £} as was to be shown. 


It is clear that, if P is finitely generated and X is infinite, then 
it admits spreading and locally stabilizing sequences. Indeed every 
sequence contains a spreading subsequence, and every sequence contains 
a stabilizing subsequence, and a subsequence of a spreading or locally 
stabilizing sequence is again spreading, respectively locally stabilizing. 
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We will content ourselves with the following rectifiable sequences, 
based on the Grigorchuk groups G, = (a,b,c,d), with w € Q’; for 
example, the first Grigorchuk group Go12. Recall the description of 
G» from §4.5, and, in particular, the Schreier graph of its action on 
°° in (1.12). We construct explicitly a spreading, locally stabilizing, 
rectifiable sequence for the action of G,» on X: for all 7 € N, let us define 


a%j = 1%0', 
the point at distance 2* from the origin on the Schreier graph. 


Lemma 7.28. For all w € Q containing infinitely many 0,1,2s, and 
for all i,7 EN, 


1. the marked balls of radius 2°") in X around x; and Tj coincide; 

2. the distance d(aj, xj) is (2? — 23]; 

3. there exists gi,j € Ge of length |2*9—2)| with LGijg = tj and cy g4,5 F 
xe for all (k, £) A (4,9). 


Proof. (1), (2) Consider the map 6, from Equation (1.16). It maps the 
stabilizer of 1° in Gow to the stabilizer of 1° in G,, and therefore defines 
a self-map of X by sending 1° g to 1°6,(g). A direct calculation shows 
that it sends x € X to <0. 

Given that 6, is 2-Lipschitz on words of even length in {a, b,c, d}, 
it maps the ball of radius n around z to the ball of radius 2n around 
x0. Its image is in fact a net in the ball of radius 2n: two points at 
distance 1 in the ball of radius n around z will be mapped to points 
at distance 1 or 3 in the image, connected either by a segment _%_ or 


by a path Soy a for some {x,y} C {b,c,d}. In particular, the 
y 


2”-neighbourhoods of the balls about the 2, coincide for all m > n. 

(3) Note, first, that there exists g;,; with xj 9;,; = 2j, because the rays 
ending in 1° form a single orbit. Note, also, that we have x, gj; = ae 
for either finitely many (k,€) 4 (2,7) or for all but finitely many (k, £), 
because there is a level N at which the decomposition of gj; consists 
entirely of generators; if the entry at 0 of gi is trivial or “d”, then all 
but finitely many of the 2, are fixed; while otherwise (up to increasing 
N by at most one) we may assume it is an “a”; then oV+t1 4; 5 = 0%1, 
so tp A wv for all k > N +1. 

We use the following property of the Grigorchuk groups G,: for every 
finite sequence u € {0,1}* there exists an element hy, € G, whose fixed 
points are precisely those sequences in {0,1} that do not start with u. 
One may take for hy the element [a, 6], [a,c] or [a, d] inside the copy 
of the branching subgroup of G, that acts on {0,1}°u, see Proposition 
5.12. 

If the entry at 0% of gi is trivial, then we multiply g;,; with hau for 
some M > max(N, 71), so as to fall back to the second case. 
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Then, for each pair (k,¢) € (2,7) with z,9;,; = a, we multiply 9; 5 
with hygc, to destroy the relation zr, 9;,; = ae. 
The resulting element 9; ; satisfies the required conditions. 


7.5. Subexponential Growth of Wreath Products 


The next step in the proof is an argument controlling the growth of a 
subgroup of the form W = (P,f) < GwxP, for a function f: X > G 
with sparse-enough (but infinite!) support. 

We select a finitely generated group P of subexponential growth acting 
on a set X with sublinear inverted orbit growth (see Definition 6.4; 
recall, from Exercise 6.5, that the property of having sublinear orbit 
growth is independent of a choice of basepoint) and subexponential 
inverted orbit choice growth. These are the hypotheses for Corollary 
6.9, which guarantee that G2x P has subexponential growth as soon 
as G has subexponential growth. The main example we have in mind 
is a Grigorchuk group P = G, acting on X = 1°P, for w € {0,1,2}% 
containing infinitely many times each symbol, see Lemma 7.28. 

We also assume that there are rectifiable sequences in X, and (using 
the results of the previous section) we fix a rectifiable, spreading, locally 
stabilizing sequence (29, 2%1,...) of elements of X. 

Finally, we fix a countable group G, and a finite or infinite sequence 
of elements (61, 62,...) generating G. 


Definition 7.29. For an increasing finite or infinite sequence 0 < 
n(1) < n(2) <... of integers, define f: X > G by 


f(@nqy) = 51, f(tn@)) = 62, ee f(z) = 1 for other z. 
The group Wn) = Wna),nQ),... is then defined as the subgroup (P, f) of 
the unrestricted wreath product Gey P. A 


If all but finitely many 06; are trivial, then Wn) has subexponential 
growth as then G is finitely generated and Wn) is a subgroup of G2x P. 
However, if (n(7)) is sparse enough, then W(,) may have subexponential 
growth even if f has infinite support: 


Proposition 7.30. If G has locally subexponential growth, then 
there exists an infinite sequence (n(t)) such that the group Win) has 
subexponential growth. 


The proof of Proposition 7.30 follows from a stronger, and indepen- 
dently interesting, statement: arbitrarily large balls in Wn) are approx- 
imable by groups of the form W(nc1),...,n(i)), Which have subexponential 
growth by the remark above: 


Proposition 7.31. Assume that the sequence (x;) in X is spreading 
and locally stabilizing, and that all elements b; have the same order. 
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Then for every increasing sequence (m(i)) there exists an increasing 
sequence (n(i)) with the following property: the ball of radius m(i) in 
Wn) coincides with the ball of radius m(i) in Wrct),n(Q),....n@), via the 
natural identification P <> P,f < f between Wn) and Wrciy,....nii- 

Furthermore, the term n(t) depends only on the previous terms 
n(1),...,n( — 1), on the initial terms m(1),...,m(i), and on the ball 
of radius m(t) in the subgroup (b1,...,bj;-1) of G. 


Proof. Choose n(i) such that d(2;,7,%) = m(2) for all 7 A k with 
k > n(i), and such that the balls of radius m(z) around %p(j) and 2; 
coincide for all 7 > n(i). 

Consider then an element h € Wyn) in the ball of radius m(z), and 
write it in the form h = (c,g) with c: X — G and g € P. The function 
c is a product of conjugates of f by words of length < m(i). Its support 
is therefore contained in the union of balls of radius m(z) — 1 around the 
x;, with j either > n() or of the form n(k) for k < 7. In particular, the 
entries of c are in (bj,..., bj;-1) U U5 (03). For j > n(z), the restriction 
of c to the ball around 2; is determined by the restriction of c to the ball 
around zp (j), via the identification b; +> 6;, because the neighbourhoods 
in X coincide and all cyclic groups (b;) are isomorphic. 

It follows that the element h € W is uniquely determined by the 
corresponding element in Wn(1),....n(i)- 


Proof of Proposition 7.30. Let Z = (z) be a cyclic group whose order 
(possibly oo) is divisible by the order of all the bjs. We replace G by 
G x Z and each b; by bjz, so as to guarantee that all generators in G 
have the same order. 

Let €; be a decreasing sequence tending to 1. We now construct a 
sequence m(z) inductively, and obtain the sequence n(i) by Proposition 
7.31, making always sure that m(i) depends only on m(¥j),n(j) with 
j<i. 

Denote by v; the growth function of the group Wnrc1),...,nci). Because 
the group Wac1),...,n(i) is contained in G 2x P, it has subexponential 
growth. Therefore, there exists m(z) such that 

v(m(i)) < EP. 
By Proposition 7.31, the terms n(¢+1), n(¢@+2),... can be chosen in such 
a manner that the balls of radius m(2) coincide in Win) and Wnrc),....n@)- 

Denote now by w the growth function of Win). We then have 

w(m(t)) < ecw for all i € N. Therefore, 


w(R) < ef *™® for all R > m(i), 


so limsup </w(R) < «; for all i € N. Thus, the growth of Win) is 
subexponential. 
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Finally, the rectifiability of the sequence (2;) guarantees that functions 
with singleton support and arbitrary values in |G, G] belong to Wn) for 
all sequences n: 


Lemma 7.32. If the sequence (x) is rectifiable, then |[W(n), Win)| 
contains [G, G] as a subgroup for all choices of n = n(1) < n(2) <---. 


Proof. We denote by 1: G => GX x P the imbedding of G mapping the 
element b € G to the function X — G with value 6 at 29 and 1 elsewhere. 
We abbreviate W = Wn). We shall show that [W, W] contains 
u([G, G]). For this, denote by H the subgroup «([G, G]) N[W, W]. 

We first consider an elementary commutator g = [b;, bj]. Let gi, 9; € P 
respectively map 2;,2z; to 2, and be such that U9)" maps no x to x 
with k 4 @, except for 4G; = 2;. Consider [f%, f%] € [W, W]; it 
belongs to G*, and has value [j, b;| at a and is trivial elsewhere, so 
equals (g) and therefore i(g) € H. 

We next show that H is normal in G*. For this, consider h € H. 
It suffices to show that A‘) belongs to H for all i. Now A!) = nf” 
belongs to H, and we are done. 


We are now ready to complete the proof of Theorem 7.1. By 
Proposition 7.13, the countable, locally subexponentially growing group 
B imbeds in [G, G] for a countable, locally subexponentially growing 
group G. Let (01, be,...) be a generating set for G. By Proposition 7.30, 
there exists an increasing sequence (n(z)) such that the group W = Win) 
has subexponential growth. By Lemma 7.32, [G, G] imbeds in [W, W], 
so B imbeds in [W, W] and we are done. 


Exercise 7.33. Give examples of sequences (2;) that are only spread- 
ing, or only stabilizing, and such that W(n) has exponential growth, even 
when the sequence (n(7)) grows arbitrarily fast. 


8. Groups of Non-Uniform Exponential Growth 


Recall that vg,s(R) denotes the growth function of a group G generated 
by a finite set S. The volume entropy of (G, S) is 


eesti Soe 
cae R>o R , 


The limit exists because the function vg_g is submultiplicative (namely, 
vg,s(R1+ R2) < va,s(R1)vG,s(R2).) Indeed, apply the following lemma 
to the sequence (log vg,s(”))n>1: 


Lemma 8.1 (Fekete [22]). Let (an) be a subadditive sequence: dnt+m < 
Qn + am for allm,n>1. Then lim ap/n exists and equals inf an/n. 
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Proof. Set A = infan/n and consider any B > A. Choose k > 1 such 
that az /k < B. Every n € N may be written in the form n = rk+s with 
ré€Nand se {0,...,4 —1}. Thus, 


On _ Urkts _ Tak TGs _ aN 8 as 


n no. on kon n’ 


so limsupnsoo Gn/n < ap/k < B. Since B > A was arbitrary, 
lim sup ay,/n < L and we are done. 


Furthermore, the following are equivalent: G has exponential word 
growth; Ags > 0 for some generating set S; and A4G,5 > 0 for all 
generating sets S. 

Let G be a group of exponential growth. Note that, even though 
X@,s > 0 for all S, one might have Ags, > 0 along a sequence of 
generating sets 5;. It is easy to see that this cannot happen for G a free 
group of rank k > 2; indeed then each 5; contains a subset of cardinality 
k generating a free subgroup, so Ag_\g, = log(2k — 1) > 0 for all 7. Let us 
say that a finitely generated group G has uniform exponential growth if 
infg Ag,g > 0, and non-uniform exponential growth if Ag,s > 0 for all S$ 
yet infgrg.g =0. 

The existence of groups of non-uniform exponential growth is asked 
by Gromov in [35, Remarque 5.12]; see [38] for a survey. There have been 
quite a few positive results: Osin showed in [64] that virtually solvable 
groups have uniform exponential growth unless they are virtually 
nilpotent; Eskin, Mozes and Oh obtained the same result in [20] for 
finitely generated linear groups in characteristic 0; Koubi showed in [49] 
that word-hyperbolic groups have exponential growth unless they are 
virtually cyclic. 

A Golod-Shafarevich group is a residually-p group such that the 
associated Hopf algebra +0 w”/o"+! from §5.1 has exponential 
growth; equivalently, the Lie algebra @,,51 Yn(@)/yn+1(G) from (1.13) 
has exponential growth. Since by Proposition 5.6 the growth of kG is 
always a lower bound for the growth of G, such groups have uniformly 
exponential growth. 

Among groups G of uniform exponential growth, one may ask whether 
the infimal entropy infisy-G4@,g is realised. Recall that a group G is 
Hopfian if it is not isomorphic to a proper quotient of itself. Sambusetti 
proved in [69] that if G is a free product, G = G) * Go with G) non- 
Hopfian and G2 non-trivial, then infig)-GAq,g is not attained. 

It was widely suspected, since the appearance of groups of intermediate 
growth, that examples of groups of non-uniform exponential growth 
should exist. The first examples of groups of non-uniform exponential 
growth were exhibited by Wilson, see [81]. We now give a simple 
construction showing that such groups abound: 
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Theorem 8.2 ([7, Theorem E]). Every countable group may be 
imbedded in a group of non-uniform exponential growth. 

Furthermore, let ny * 2.46 denote the positive root of the polynomial 
T?—T?-2T—A4. Then the group W in which the countable group imbeds 
may be required to have the following property: there is a constant K such 
that, for all R > 0, there exists a generating set S of W with 


uw,s(r) < exp(Kr'°82/!08"+) for all r € [0, RI. 


Proof. Let B be a countable group. By Theorem 7.7, one may imbed B 
into a group G generated by two elements s,¢ of order 5. Without loss 
of generality, assume that G has exponential growth (if needed, replace 
first B by B x Fy). 

The group W in which G imbeds is the wreath product G?x Go12 of G 
with the first Grigorchuk group. We also consider A = Cs x Cs = (s’, t’), 
and the wreath product W’ = Ax Go12. 

We consider the points 29 = 0© and x; = 0% 10° for all i > 1 in the 
Schreier graph X, and the generating sets $; = {a, b, c, d, s@zo, tQz;} of 
W and S’ = {a, b,c, d, s’@zp, t/'@zx} of W’. 

Note that the sequence (x;) is spreading and locally stabilizing (see 
Definitions 7.20 and 7.23); better, for all R the radius-R balls around 
xo and x; in X are isomorphic as labelled graphs for all i large enough, 
because the action on a sequence € {0,1}° of an element of Goi2 of 
length R depends only on the last [logg(R)] symbols of the sequence. 

We now claim that, for all R € N, there exists 7 such that the balls 
of radius R in the Cayley graphs of (W,5;) and (W’,S’) coincide. 
By Theorem 6.3, there exists a constant K such that, for all R €¢ N, 
the ball of radius R in the Cayley graph of (W’, 5S") has cardinality 
vw7s'(R) < exp(KR°82/!087+). Assuming the claim, we get vy,s,(R) < 
exp(KR!82/1l08"+) from which the second claim of the theorem follows. 

For every « > 0 there exists R such that KR!°S?/l98"+ < eR, so 
vw,s,(R) < exp(€R) and Aw, gs, < € for some 7. It follows then that W 
has non-uniform exponential growth. 

It remains to prove the claim. Given R € N, let 7 be large enough 
so that the distance between xz and 2; in X is at least 2R. Consider 
a word w in S; of length < R, and let w’ be the corresponding word 
in S’ obtained by replacing sQ@2o, {@z; respectively by s’Qzp, t'@2o. We 
show that w represents the identity in W if and only if w’ represents the 
identity in W’. 

Write w = (c,g) in W, with c: X — Gand g € Goia. Similarly, write 
w’ = (c',g) in W’, with c’: X > A and the same g € Go12. Note that 
the support of c is contained in the union of the balls of radius R around 
xo and x;, and these balls are isomorphic and disjoint. Therefore, c can 
be written in the form c = cic2 with cy: X > (s) and c: X => (t), 
and cj, c2 have disjoint support so they commute. The function c’ may 
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correspondingly be written as c! = c}cy with c,: X > (s‘) obtained 
by composing cj with the isomorphism (s) > (s’), and ch): X —> (t’) 
obtained by composing the isomorphism from the radius-R ball around 
xp to the radius-R ball around 2;, the map c2, and the isomorphism 
(t) > (t’). Therefore, c’ = 1 if and only if c = 1, so the balls in W and 


W’ are isomorphic. 
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Abstract 

We study the decay of convolution powers of probability measures 
on some discrete groups that are not finitely generated such as the 
infinite symmetric group G = S®©) and algebraic direct sums of finitely 
generated groups G = )°Tj, e.g., with [; = Z/p;Z for some sequence of 
integer p; or Tj = Z4 for all i. 
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1. Introduction 


Let G be a countable group equipped with the counting measure. For 
any finite subset B of G, we let |B| be the number of group elements 
contained B. Let @ be a probability density on G. We say that @ is 
symmetric if (4) = ¢(a—1),x € G. The measure @ induces a random 
walk on G that starts at the neutral element e and whose law at 
time n is the nth convolution power ¢™. Understanding the large 
time behavior of the functions n > ¢™(e) and (n,2) BH ¢™(z) is 
of interest from many different viewpoints. For instance, the random 
walk driven by @ is recurrent if and only if oe) = oo, the 
group G is nonamenable if and only if there exists a symmetric ¢ such 
that lim SUP ns 00 (6 (e)) 1” < 1. In many cases, the behavior of the 
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probability of return, 6 (e), captures important information about the 
algebraic properties of the group. When ¢ is symmetric, nb $2” (e) is 
nonincreasing and tends to zero unless the support of @ generates a finite 
group. Roughly speaking, the larger the group and the more spread-out 
the measure, the faster 6 (e) decays. 

On a finitely generated infinite group, symmetric random walks driven 
by a probability density with finite generating support form a natural 
class of random walks. For results describing the behavior of n g™ (e) 
in this context, we refer the reader to [15-18, 21, 22]. The fundamental 
result of [16] asserts that the behavior of the probability of return of 
a random walk in this natural class is, in some sense, an invariant 
of the finitely generated group that supports the walk. Some results 
regarding spread-out measures on finitely generated groups are given in 
(2, 5, 16]. 

The present work focuses on random walks on countable groups that 
are not finitely generated. A fundamental difference with the case of 
finitely generated groups is that, on a general countable group (e.g., on 
Q), there exists no obvious “canonical” class of random walks. Instead, 
depending on the particular structure of the group, there might be large 
natural families of random walks that present a continuum of behaviors. 
One of the most basic examples of countable nonfinitely generated group 
is the group Re of rational numbers of the form m2—*, m € Z,k = Z. A 
natural symmetric generating set is S = {42-* k = 1,2,...}. For any 
probability distribution p = (pz) on {0,1,2,...}, the symmetric proba- 
bility density dp = podo} + er EE (6(9-kj + 8;_9-ny) defines a natural 
random walk on Rg. The transience and recurrence of such random walks 
is studied in [9]. Unfortunately, we will have very little to say about this 
example except to point out that little is known about the behavior 
of 6 (e) and how it depends on p. The fundamental structure of Ro 


emphasized above is that Ro is the increasing union Ro = UZ of 
copies of the finitely generated group Z. Any countable group is (in many 
different ways!) the increasing countable union of a family of finitely 
generated groups, but this type of structure is not always very natural. 
When a countable group G is the increasing countable union of a 
collection of finitely generated groups Tj, G = Uje;T';, indexed by a 
countable ordered set J, it is natural to pick a symmetric probability 
density ¢; on each I; (say, with finite generating support) and consider 
the walks driven by the probability densities dp = se 1 PiPi, Where 
p = (pi) is a probability distribution on J. Unless some additional 
structural hypotheses are made, very little is known about how to 
approach the study of the behavior of n a) in this context. 
Earlier works concerning examples of random walks on countable 
groups include [7, 9, 13, 14]. These works contain additional references. 
The works of Kesten and Spitzer [13] and Fereig and Molchanov [9] are 
based on the use of Fourier transform and deal with abelian examples. 
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The works of Lawler and of Bofferio and Woess deal mostly with the 
infinite symmetric group S®), that is, the group of permutations of the 
integers that leave all but finitely many integers fixed. All these works 
focus on the dichotomy between transient and recurrent behaviors. 

The present work is closer in spirit to [3] and [5, 16, 21]. Our aim 
is to obtain results describing the decay behavior of n + $\(e) as 
precisely as possible in a variety of situations. The technique we will use 
is based on two simple ideas: (1) often, there are very specific examples 
that can be studied using rather elementary but ad hoc techniques; (2) 
once a good example (or a collection of good examples) is understood, 
one can use comparison techniques to understand the behavior of many 
more examples including examples that are quite different from the ones 
studied in the first step. 


1.1. Union of Finite Groups 


One interesting special case that we will consider at some length is the 
case when G = U;e;I; and the IT,’s are finite groups. The following 
three examples are of this type. 


Example 1.1. The infinite symmetric group S© is the group of those 
bijections of {1,2,...} that move only finitely many integers. Obviously, 
this is the increasing union of all finite symmetric groups, S°©°) = Ure Si. 
There are subtleties in studying models of random walks on S©). For 
instance, we are able to determine quite precisely which p = (p;) make 
>> pidi recurrent or transient when ¢; is the “random transposition” 
measure on S;, but we are not able to do so when @; is the “adjacent 
transposition” measure. This last case points toward interesting open 
questions. 


Example 1.2. The infinite special linear group with coefficients in 
the finite field Z/pZ can be defined as the increasing union of SL;(p). 
Thinking in terms of matrices, to go from i to i+ 1, add a column to 
the right and a row to the bottom. Identify SL;(p) in SL;41(p) as those 
matrices with 0 entries in the last row and column except for the bottom 
right entry, which is 1. 


Example 1.3. Let B(8,1r) denotes the Burnside group with r genera- 
tors and all element of order 3. This is a finite group of order 36)+@)+(), 
Ordering the generators of each B(3,7), we obviously have consistent 
natural injective homomorphisms from B(3, 7) to B(3,7+1), 7 => 1. Then, 
the group B(3,00) = Uf? B(3,7) is the increasing union of the groups 
B(8, 7). This group comes with a natural infinite sequence of generators 
{g1, 92,--:} such that {g1,...,9;} are the generators of B(3, 7). 


In each of these examples, there are a variety of interesting ways 
to pick symmetric probability densities ¢; supported on Tj. Setting 
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dp = >; Pivi as above, one would like to study how the decay of 
ne os” (e) depends on p and on the choice of the ¢;’s. 


1.2. Direct Sums 


Another special case of interest is when G = )°, G; is the direct sum of 
finitely generated groups G;, 7 € J, that is, 


G= {(gilicer : 9: € Gi, gi = ej for all but finitely many 7}. 
In this case, assuming that J = UjeyJ; with Ij, 7 € J, being an increasing 
family of finite sets, we also have G = Ujez (Diex Gi). 
However, the most natural families of probability densities on G = 


ys 7 Gi associated with the direct sum structure are defined as follows. 
On each Ge, let we be a probability density and set 

; ace: _ J velo) if gi = e; for alli 4 ¢ 
de: G [0,1], 9= Gi) oe(9) = 0 ie 


(2.1) 
Then, for any choice of a probability density p = (p;)7, set 


bp = >> pid. (2.2) 
f 


Here are two simple examples of this situation. 


Example 1.4. Given a finitely generated group I’, let G = I’) be 
the sum of countably many copies °; of I. In this case, it is very natural 
to pick a probability density w on I’, set %; = w on each I’; and consider 
the probability densities 6; and dp = 0 pig; on G defined by (2.1)—(2.2). 


Example 1.5. Given a sequence of integers mj, 7 = 1,2,..., set Tj = 
Z/[mZ and wi(+1) = 1/2, wWi(z) = 0 if « 4 +1. In this case, the group 
G= ae I; is both a direct sum and an increasing union of finite 
groups. 


1.3. Other Examples 


There are, of course, many other types of examples. One of the simplest 
is the group U(oo) of all (infinite) upper-triangular matrices with 1 
along the diagonal and finitely many nonzero integer entries in the upper 
half. This group can obviously be viewed as the increasing union of the 
upper-triangular nilpotent groups U(n). Let have all nondiagonal 
entries equal to zero except the (7,7 + 1)-entry, which equals +1. The 
elements (ge yield a natural symmetric generating set. Set ¢; = 


4 (Se + 8g, + 64-1). On U(oo), one can consider the family of probability 


measures dp = SP pigs with p = (pi)f° as before. We know little about 


these measures except for the obvious fact that os” (e) < oy” (0) where 
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Pp is the measure naturally associated to p on Z‘%), the direct sum 


of countably many copies of of Z. We know a lot about $y,” (0). See 
Section 6. 

Understanding whether or not the upper-bound of” (e) < $y (0) is 
sharp for some interesting p is an open question. 

Two groups that are much easier to understand than U (oo) are Hj (co) 
and H described below. 


Example 1.6. The group Hj(co) is a nilpotent group of class 2 


generated by the elements 2; ae ~ a = 1,..., and such that all 
pairs (a, 2), (%, yj) and (y, yy) commute except 2;, y;, which satisfies 
[xi, ys] = 2 for all i > 1 with z central. The subscript 1 in Hj(co) 


refers to the dimension of the center. The group H1 (oo) can be viewed 
as the increasing union of the usual Heisenberg groups Hy(2n + 1). 
It can also be identified as the set Z'°) x Z©) x Z with group law 
(z,y,2)-(2,y',2.) = (ata ,yty,zt2’+20-y') where x+y! = OP ay. 
This makes good sense because the sequences z, y’ are eventually 0. 

Let [H1(3)|° be the direct sum of countably many copies of the 
Heisenberg group Aj(3) = U(3). Let ¢; be the probability measure 
defined by oi(a*) = gi(yf') = gi(e) = 1/5 in Hi(o), and set 
dp = > vidi with p= (pi)f° as before. 

Similarly, let dbp (resp. bp) be the measure on [H(3)|°° (resp. the 
measure on [Z?]‘%°, the direct sum of countably many copies of Z?) 
naturally associated with p. Because we have natural surjective group 
homomorphisms 


[H1(3)|© > Hy (00) > [Z?], 


it immediately follows that 


BL” (e) < HS (0) < by. 


This yields good bounds on os” (e) based on the bounds for direct sums 
obtained in Section 6. 


Example 1.7. The group H is also a nilpotent group of class 2. It 
is generated by 27", 1 = 1, where all pairs (aj, 2;) commute except 
(xj, 2141), which satisfies [2j, ej41] = 2 with 2 central, 7 > 1. This is a 
subgroup of [1 (3)|°. Namely, if we write g € [H1(3)|(© as a sequence 
of 3 x 3 upper-triangular matrices g; with upper-entries X;, Y;, and Z;, 
then H is the subgroup of those sequences such that Y; = Xj41,7 > 1. In 
addition, there is again an obvious SIRIECHNE homomorphism onto Z©), 


Let ¢; be the probability measure ¢; = (Se + be, +6 art) on H. Let 
p = (pi)P° be a probability distribution such that there are constants 
0<a<1<b<o such that ap; < pj+1 < bp;. With these notations 


and hypotheses, the comparison techniques of [16] easily yield that, for 
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fixed a, b, there are constants c, C' € (0,00) such that for all p satisfying 
ap; < pit < bp; and all n, we have 


cB (e) < (0) < by” O). 

Here, dbp is the probability measure on Z? naturally associated to p and 
#50 +6, +6_1), and @, is the measure on [Hy (3)| associated with the 
probability distribution p and the measure $(5¢ +42 +5,-1 + 4y +5y-1) 
where x (resp. y) is the matrix in H1(3) with trivial entries except the 
(1,2)-entry (resp. the (2,3)-entry) which is 1. Again, this gives good 
bounds on i) based on the bounds for direct sums obtained in 
Section 6. 


2. Selected Technical Tools 


Let G be a countable group and ¢ be a probability density on G. Set 
t — tn (n) 
a n 
Oe ae ae (2.3) 
0 


ne is the probability density of the continuous time random walk driven 
by ¢@ at time ¢t > 0. The following well-known proposition says that 
Nhe p?2™(e) and th he (e) have essentially the same behavior when @ 
is symmetric. We will make frequent use of this result. 


Proposition 2.1 ([16, Prop. 3.2]). Assume that @ is symmetric. Then, 
forn=1,2,... 


pert) (e) < 2hf (e) and hf, (e) < e?" +o? (e). 
The Dirichlet form associated to a symmetric probability density ¢ is 
1 


fo 9) = 5 


Y= If(ey) — F@/?6(y). (2.4) 
z,yeG 

The following Proposition indicates that if two symmetric probability 
densities @;, 1 = 1,2 have comparable Dirichlet forms f tr &%,(f,f), 
i = 1,2, then the behaviors of n o?” (e), i = 1,2 are essentially the 


same. 


Proposition 2.2 ({16, Lem. 3.1 & Prop. 3.2]). Assume that @;, i = 1,2 
are symmetric and that there is a constant A such that, for all functions 
f with finite support on G, Eg, (f, f) < A€g.(f,f). Then we have 
eVi>0, he) < ho a(e)- 
© For all n =1,2,..., oA"? (e) < 26-2" + 2g CL"/AD (e), 
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Given two nonnegative functions f,g of t > 0 or n = 1,2,... ina 
neighborhood of 0 or infinity, we say that f = g if there are constants 
cy € (0,00), 1 = 1,2 such that cq f(t) < g@) < cf in a neighborhood 
of 0 or infinity. We say that f ~ g if there are constants c; € (0,00), 1 = 
1,...,4 such that cy f(cot) < g(t) < cgf (cat) in a neighborhood of 0 or 
infinity. For instance, f(t) = exp(—t!/?) and g(t) = exp(—5t!/3+t!/442) 
satisfy f ~ g near infinity. We will only use this notation when at least 
one of the functions f or g is monotone. Using this notation, we can state 
the following two-sided version of the previous proposition. 


Proposition 2.3 ((16, Lem. 3.1 & Prop. 3.2]). Assume that ¢;, i = 1,2 
are symmetric and that there are constants a,A € (0,00) such that, 
for all functions f with finite support on G, a&€g,(7f,f) < €,(0/,f) < 
A€g,(f,f). Then we have 


hp?(e) ~ hp (e) at infinity 
and 
62 (e) ~ b12”(e) at infinity. 


Given a probability density ¢ on a countable group G, let Rg be the 
operator of convolution on the right by ¢, Rg: ft Ref = f *¢ acting 
on finitely supported function and, by extension on the spaces L?(G), 
1 < p < w. With this notation, ne is the convolution density of the 


semigroup of operators e U-F¢) | that is, e tU— Re) ¢ =f x Be 


3. Random Walks on Increasing Union of Finite Groups 


This section focuses on group G that can be viewed as the increasing 
union of a countable family of finite groups G;. Namely, we assume 
that there is a countable infinite family of finite subgroups G; C G, 
a= 1,2..., such that G; C Gj for any i < j and G = UPG;. Note 
that we assume that the sequence of subgroups Gj is strictly increasing 
so that G is infinite. See Examples 1.1, 1.2, 1.3 and 1.5 described in the 
introduction. This case is also the case considered in [14] and in [7]. 


3.1. On Diagonal Upper Bounds: Convex Combinations of 
Commuting Probability Densities 


For each i, let v; be a probability density on G; (with respect to the un- 
normalized counting measure). By extension, we view all v; as probability 
densities on G as well. 

In this section, we make the following rather restrictive assumptions. 


(a) for all 1,7 € J, uj * vj = vj * Uj, ie., the v;’s define commuting 
convolution operators on G. 
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(b) For each i there is an integer T; such that suPrea, {hy (x)} < 
C/|G;|. The time T; provides a rough notion of “time to equilib- 
rium” for the random walk driven by v; on the finite group Gj. 


We note that hypothesis (b) does not imply convergence of o® to the 
uniform probability u; on G;, nor even the convergence of hy * to uj. 

For any probability measure p = (p;) on N, we consider the probability 
measure Up on G defined by 


[o,@) 
Up = Se piri. (2.5) 
1 


Because the operators Ry, commute, we have 
e tI Rup) TT] ete Ra) 
I 


and 


* 
h,” (e) = TI ms (e) (convolution product evaluated at e). 
I 


This leads immediately to the following estimate. 


Proposition 3.1. Under the assumptions (a)—(b) introduced above, we 
have 


Ves. th? Ce) 


IA 


cint {|G -4eT, tp > Ti}. 


Example 3.2. On S©) = UPS;, let vj be the random transposition 
measure on S;, that is, vj(z@) = 2/n(n—1) if = (k, 2) with 1<k<@€<i 
a transposition and 0 otherwise. In this case (see, e.g., [19]), 


1 
h,?(e) < C— for t= Ti/pi, Ti = 2ilogi. 
UL 


Suppose that pj ~ i-*%, a > 1. Then we have hy (e) < g for t = 
219+! log i. This implies that (with different C, c) 


Vt>0, h,?(e) < Cexp(—ct/O*® (log t)/O1%), 


As we shall see later, this result is a little bit off and we do not know 
what the behavior of he ®(e) is exactly in this case. 


Remark 3.3. The anonymous referee points out that, when the mea- 
sures v; are symmetric, we have €), > p;€y,; for any 7 and assumption 
(a) can be omitted in the proposition stated above. By the comparison 
techniques of [8, 16] (ie., Proposition 2.2), the assumption (b) alone 
implies the stated conclusion. 


The next subsection proposes a more general and more powerful 
technique. 


RANDOM WALKS ON SOME COUNTABLE GROUPS 85 


3.2. On Diagonal Bounds: Convex Combinations of Haar 
Measures and Comparison 


We now explain how to apply the two basic ideas described in the 
introduction in the context of countable increasing union of finite groups. 
Let G = us G; as before and let u; denote the uniform probability 
density on the finite group G;. For any probability p = (p;) on 
{1,2,...}, set 


KH op = > Pi RS Ayess) 


i>k 


and consider the probability density up on G defined by 
[o,@) 
Up = SS pie: (2.6) 
1 


The behavior of these measures is studied in [3] using the observation 
that 


Up = } Pi, + * Pty, Umax{ty,...,2n} 
tytn 
n 


Hence 


(n) 5G nf 2 -z). oF 
my (= Dd a) (= Gul ae 


2 


We write this in the form of a Laplace transform, namely, 


lee) lee) 
up” (e) — / e ™dF(s) = nf e_"® F(s)ds (2.8) 
0 0 
where 
1 1 1 
F(a) = =— for log —— 4 6 = log ——$ FS 2, Be 
|Ge-1 1— of 1—op-1 


The following less-precise version is often easier to handle. 


Proposition 3.4. We have 


(n) x —ns = ei —ns 
Up C6) / e '’dF(s)= nf e '” F(s)ds 
0 0 
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where 


F(s)= for o, < 8 <op_1, K=1,2,.... 


1 
|Ge_1| 

Many explicit examples illustrating this proposition are discussed in 
[4], and we will discuss some subtle use (2.7) below. What make this 
computation very valuable is that, by using comparison techniques, 
they lead to interesting estimate for other random walk models. See 
Proposition 3.7 below along with its applications. 


Example 3.5. Because Gy C Gy+1, we must have |Gz44| > 2| Gz. 
Hence, if g(k) = | G;|, the slowest possible growth rate of g is exponential. 


ee < Aso that g(s) x e*. Assume further 
that o, = &(k) where & is a continuous decreasing function such that 
sup; aes < A. Under these assumptions, Proposition 3.4 is very easy 


to use given that 


Let us assume that sup; 


Fe : 
Gib 
For instance, if €(s) = cys~% (hence py ~ k~*~!) then 
ug” (e) we ele 
As a second example, consider the case when g(k) =< ceP® and &(s) = 
Cye >. Then 
ug” (e) ~ n ble, 

Next, for each 7, we fix a symmetric probability density ¢; on G; and 
we let A; be the second smallest eigenvalue of J — Rg, on 1?(G;). By 
definition A; is given by 

E¢; 
je int | off) 
If — wih 


Given a sequence (¢;){° as above and a probability law p = (p;) on 
{1,2,...}, we set 


5 if 0?(Gi), f not a constant function| : 


[o,@) 
Pp = S- vidi. (2.9) 
1 
Lemma 3.6. Referring to the above setup and notation and assuming 


that at least one rj is positive, define q = (qi) by setting 


Ap) = Do Asvi, Ge = ACP)" Aas. 
Then we have 


MpEug < Eby < 2Euy - 
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Proof. Simply observe that Ey, = > piEg, and that rAjEu, < Eg, < 2Ex,. 
The last inequality expresses the fact that the nonzero eigenvalues of 
I — Rg, are in the interval [;, 2]. 


Proposition 3.7. Referring to the above setup and notation, there are 
constants c, C' € (0,00) and an integer k such that 


op (e) < Cu?” (e) 


and 


uP (e) < CHP” (e). 


Proof. Applies Lemma 3.6 and Proposition 2.3. 


Let us point out that Proposition 3.7 provides excellent control of bp 
when the family (Gj, @;);¢7 has the property that 4; = 1. We give two 
explicit examples. 


Example 3.8. Consider the infinite dimensional special linear group 
with coefficients in the finite field Z/pZ defined as the increasing union 
of SL;(p). Thinking in terms of matrices, to go from i to i+1, adda 
column to the right and a row to the bottom. Identify SL;(p) in SL;41(p) 
as those matrices with 0 entries in the last row and column except for 
the bottom right entry, which is 1. Let ¢; be the “random transvection” 
measure on SL;(p) (see [10, 19] for a detailed description). Then A; = 1 
and ose) & ug” (e) for every choice of p. 


Example 3.9. M. Kassabov [11] proved that the symmetric groups S; 
can be made a family of expanders by an appropriate choice of symmetric 
generating sets B; C S; of uniformly bounded cardinal. Assuming as we 
may that each B; contains the identity element, let @; be the uniform 
probability measure on B;. Then the associated spectral gap sequence A; 
satisfies A; = 1 and we have os” (e) ~S ug” (e) for every choice of p. See 
also [12], which provides further examples of this type involving finite 
simple groups. 


4. Examples on S“) 


In this section, we write SO) = US’Sn where S,, is the symmetric group 
on {1,...,n} objects. 


4.1. Behavior of Convex Combinations of Uniforms 


Fix p = (pi)n and of = )0;5,% pi. The computation of the previous 
section yields 


lee) 
ug” (e) =f e~"' dF(s) where F(s) = 1/k! if s € [og, 04-1). (2.10) 
0 
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For the case when pp = cyk~% and pp = 2~*t!, the computations are 
straightforward and given in [3]. One obtains 

log n)(log log n) if py = 27*+1 
eee 
08 ty Ce) n4/“ (log n)i-V/o if pe = cgk%, a> 1. 

Next, we treat the more difficult case when (recall that (n+ 1) = n!) 
ke 1 
kl T(k-a—1) 


Pk Ok R 


As we shall see, this case allow us to obtain both recurrent walks and 
transient walks and to capture the transition from one case to the other 
as a function of the parameter a. In this particular context, for s small 
enough, we have 


log log 1/s \“ log log 1/s\*~" 
C18 ewea/2 < F(s) < Cis legless ‘ 
log 1/s log 1/s 
By [6, Theorem 1.7.1’], for n > 2, we have 
log a log | a 
con! (“ece*) < ul (e) < Gn (“Ecs" ) . 
log n log n 
The following result is more precise. 


Theorem 4.1. On S‘%) with pp x Fe, aeéR fired, we have 


log log *) oo 


Vn, ug” (e) < Cn! ( 
log n 


Further, for any fixed constants 0 < c < C < ov, there exists cy > 0 
such that for n € U,(e#, cH), we have 


log log n) ) oy. 


ug” (e) > cn! ( 
log n 
Finally, 
1 
(n) _; (log log n)** 
Vn, Up (e) > con tinge 


and there is a constant cz > 1 such that this lower bound is sharp up to 
a multiplicative constant for integers n in Uk (03 ertsgk> C3 EabgR): 


Proof. The upper bound has already been proved above. For the lower 
bounds, we go back to the formula 


(n) — n 1 oN 5 eM 
Up aT =e) (—<- x) “38 (kK-D! 


1 


RANDOM WALKS ON SOME COUNTABLE GROUPS 89 


Set A, = fi and consider 


! 
Ax = eo MOK-FK+1) ki = ke (Ok FK+1) 


Abel =D! 


For a fixed n, let € = £n be the integer such that 
t! (€+ 1)! 
— <n < ——. 
La (€+1)¢% 
For k € {mq,...,£—1} with mg a sufficient large integer, we have 


Qn(op — Op41) = nog > cop tov-1 X (1+ 1/2)" % > £/2 


for some c > 0. Hence Ag/Agyi < le—/*, For n large enough (hence 
£ large enough as well), Ag/Agy1 < 1/2 for all k € {mg,...,€ — lj. It 
follows that 
e-1 
> Ag < CAe. 
1 


For k > €+1, n(ox — on41) < C. Hence Az/Agii = e~Ck > 2 for n 
large enough and 


lee) 
= Ax < C’ Ags. 
042 


This shows that 
een eon t1 


Gi ty 


tin” (e) = 


In particular, for any fixed constant c € (0,1), and any n large enough 
such that n € (co, +, (cog)~1) for some k, we must have k = fy or 
k = £,, +1. In both cases, it follows that 


u(e) x its got loglogn\ 7! 
P “he De log n : 


However, the union of the intervals (co, ", (cox)—'), k = 1,2,..., does 
not cover all large integers. 


To understand the behavior of u, 


0 (e) when n is away from the special 


values Ge for n€ Gren £= fn, we define t = tn by 


Note that 


-1 -1 
te (1,oeo,,1) and ogo; ,) = &. 
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log log s\“~1 
aie =( og ices) 
S log s 


We also set 


and observe that 
1 
(—1)! 


®(n) x t!@(o,1) xt} 


Because we have 
C1 (ee 4 poratoenied) ug” (e) 


(€—1)! 
< a ai (ee +4 a) 


and that toe+1/oe < c5 and Wp = = d(o, }), we obtain that 


ct (@o + *) O(n) < uy (e) < Ct («* + 7) (n). 


For any fixed b > 0, the function ge(s) = se~*+s/€ satisfies g(s) ~ 1 for 
s near 1 and s near @ and it attained a minimum of order ¢~! log é near 
s = log é. Near 1, ge(s) ~ se~* and on the interval (log @, 2), ge(s) x s/€. 


we obtain that 


Because for any n, = ln ~ Paar 


log] 2 1 flog] 
ul (e) > ed(ny S08 og n) B (*e og n 
n 


a 
) log log n. 
log n log n 


Further, this lower bound is echived (up to a multiplicative constant) 
at values of n of the form n = o, loge. 


4.2. Walks Based on Small Generating Sets of S,, 


In this section, we study random walks on S°) that are of the form 


dp = x DiGi 
1 


where the sequence ¢; is a natural sequence of probability measures ¢; 
where ¢ is supported on the symmetric group S;. One of the most natural 
examples is when 


1l/iift=e 
dit) =} 2/7 ifr=(—,9,lsp<qsi 
0 otherwise. 


This probability measure on S; is known as “random transposition.” 
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It is important to note that the measure dp = )-; pidi can be written 
in many different ways. In particular, we can collect all occurrences of 
the transposition (p,q), p < q, as follows. 


a 20; 1 
O'= (25 AC Le 5 8,9: 


In other words, 


dp = OF = » Bie; 
where 
lee) [o.@) 
P=) 0 7p, Bj =2G-1) > 1 7p, 
1 w>j 
and 


o1 = be,0;=G-D YI bay. 
1<k<j 


The measures w are considered in [7] (model 11). 
In addition to the map p+> p, consider the maps 


pr> ps, (p+)i — ce (p) tL + ity; 


where ci(p) = 0,(1+ i)*!p; (assuming the sum is finite). 
It is well known (e.g., see [19]) that the spectral gap of the measure 
o; and @; are both of order 1/7. Hence, Lemma 3.6 yields 


CONpE,.. Ste, = 2Ey, 
and 


CO_(P)E ry < Ew, < 2&y,. 


p 


Obviously, we also have €g, = ews: This means that 5” (e) ad wy” (e) if 
and only if p; = p;. This will happen whenever p; decays as a deuentistle) 
power function. 


Proposition 4.2 (Random transposition). Referring to the above 
setup and notation on S©), there is a constant c such that for any 


probability p = (pi) 
CO_(P)E rip < by < 2E ug - 
In general, for each p, there are constants C',k such that 
gh” (e) < Cus” (e) 
and 


ui (e) < Coy” (e). 
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In particular, when p is such that pj x (A+ i)! p;,we have 
wy (e) = oF (e) = up (e). 


Corollary 4.3. Referring to the above setup and notation on SO), 
assume that p is such that pj < *. 


1. The walk associated with by is recurrent if and only if a < 3. 
Further, for n € Uj, (ck! k~%, Ck! k-%), 
_1 (loglogn\*? 
oon (mebn) 
og n 
2. The walk associated with wp is recurrent if and only if a < 2. 


Further, for n € L,,(ck! k~%, Ck! k7%), 
if if a-1 
wy” (e) ~n! (“ a) ; 
log n 


Remark 4.4. A typical case when pj; % (1+i)~!p; is pp x A+i)~%, a> 
1. Indeed, in this case, pj = p;. As —log(ug” (e)) ~ n/4(og n)i- Ve, 
Proposition 4.2 gives 

en/OF% dog ny VA+%) < — log(gy” (e)) < Cn! (log n)i-l/*. 


Note that in this case, we also have — log(oy”(e)) ~ log(w” (e)). 


Remark 4.5. In [7], four different models are considered. As mentioned 
above the model j11 in [7] is (essentially) what we call wp here. For the 
other models pj, i = 2,3,4 of [7], we have €,, < CE&,, as long as the 
coefficients p; in these models satisfy 


lim sup p;/pi41 < 1. (2.11) 
a 


Under condition (2.11), we also have p; = (1+ i)! p; and 
Vn, uM (e)~ ue), 1=1,2,3,4. 


Example 4.6. Another example of interest (there are many on 
S@)), is built on adjacent transpositions. On S;, set vj = (i — 8 es 
Vi<j<i 9G,j+1)- Set 


lee) 
Vp = Yo pivi = >> b8G,540) pj = s (i —1)-1p; 
j=l i>jt+l1 


The spectral gap of v; is known to be of order 1/i? (see [19] and the 
references therein). This implies that there are k, C such that 


k k 
us (e) < Crp (e), vp (e) < Cups (6) 
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where p_, denotes the probability proportional to (1 + 1)~”p;. Even in 
the best-case scenario where p; < (1 + i)~!p;, we only get the bounds 


ugh (e) < Cvg(e), vg" (e) < Cup™ (e). 
However, on each S?, we also have ci 7Ew, <&,, < C&,. This gives 
os (e) < Cvh(e), vp? (e) < Cah”, (e). 
Now, if we assume that p; x (1+ i)~!p;, we have 
of, (€) < tp, (€)- 
In particular, if p; x pj x (1+ i)~!p;, we have 
ugk (e) < Cre), vO (e) < Cus, (e). 


Still, we are not able to describe precisely the behavior of vg” (e). In 
fact, we are not able to caput the transition between recurrence and 
transience for vp with pj = oa in terms of the parameter a. 


5. Around the Recurrence/Transience Dichotomy 


This section abstracts and extends the results obtained in Theorem 4.1. 
We consider G = |); Gy (increasing union of finite groups and study 


ug” (e) under the hypothesis that 


1 
k => 
&(|Gxl) 
where & is a regularly varying function of positive index. In addition, we 
assume that the sequence | Gz+1|/|G| is weakly increasing in the sense 
that 


on(p) = 


Veet |Gr+il < ol Gm (2.12) 
|Gz| IGml| 


We already know that we always have (see (2.7)) 


—NnoE 


x e€ 
ug” 
(e) = ) : 
7 1 Ge-11 


However, under the present hypotheses, we claim that 


—nog en ott 


+ 
|Ge_1| |Ge| 


tig” (2) ® for ne [&((Gel),E(Geri))]- (2-13) 


To prove this claim, set Ay, = e~”°* /|G,_1| and observe that 
Ak = e  MOK-oK+1) Gx ; 
Ag+ |Ge-1 
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For n, @ related as in (2.13), we have 


Ax |Ge| ( an 
< ex fork<@-1 
Apa (Cea ON. SGA 
and 
A 
Bs gol Gel fork>€+1. 


>e 
Ag+ |Ge_1| 


It follows that for n large enough (this forces £ to large as well), we have 


A, [ <1/2ifk<2 
Apw | =2if k > 2. 


This proves (2.13). 


Theorem 5.1. Assume that (2.12) is satisfied and that p is such that 
on(p) = 1/&(|Gzl) where &— is regularly varying of positive index a. 
Assume further that |Gy|/|Gp_1| = y(|Gel) where y is a slowly varying 
function. Then 


C =1 
Vn, ug” (e) < i ae 2s co 
&~*(n) 
Furthermore, for any fixed constants 0 < c < C < ow, we have 


Vn el (c6UGkD, CEUGED). HO) = ST 
k 


In particular, the walk associated with up is recurrent if a > 1 and 
transient ifa <1. 


Proof. For the upper bound, it is best to use (2.8) and to observe that, 
in the present case, the function F(s) in (2.8) is bounded above by 


yo& *A/s) 
&-1(1/s) 
It follows from (2.8) and [6, Theorem 1.7.1] that 


F(s) 


n yo& *(n) 


For n x &(|G2|), the estimate (2.13) gives 


1 1Gel | Cyoé"(n) 
IGel|Geal~ = én) 


ug” (e) = 
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Example 5.2. The Burnside group B(3, i) has order 33)+@)+G) wer? 
This gives the following behaviors of ug” (e). If pp ~ k-*-!, aw > 0, then 


ug” (e) ~ exp (—nvG+o) : 
If py ~ e7* then 
log(ug” (e)) ~ —Cog n)°. 


In order to apply Theorem 5.1, we note that 


|B3,r)| 


(BG.r— DI = y( BC, 7r))) 


where y is slowly varying and log y(t) x (log t)?/. If py = |B(3, k)|-8, 
B > 0, then Theorem 5.1 yields 


cpr 1/8 exp(— Cj (log n)?/3) < ug” (e) < Con V8 exp(—en(log n)*/*) 
as long as n x |B(3,k)|* for some k. 


Next, set ¢- = (1+ r)7! >°6 Sw; Where wo = e and wj, 1 <i <r, are 
the canonical generators of B(38,1r). We are interested in the behavior of 


bp => pidi, P= (DDE 
on B(3,0o). We note that 
bp = Yo vidi = ai (Xa +10) )bo 
r=0 ‘j>r 


Further, it is known that the spectral gap of ¢; on B(3, 72) is of order 1/1. 
See [19, 20]. Hence, setting pj = 0; ;(1+ i) "pi 


k k 
ug (e) < Cbs), bp” (€) < Cup™ (C). 
In particular, if p; x (1+ 7)~!p; then 
by CO) 2a, ©); 
This happens in particular if pj = A’, A> 1. 
6. Direct Sums of Finitely Generated Groups 


In this section, we consider the case when G = )°; Gj is a (countable) 
direct sum. Recall that the elements of G are simply the sequences g = 
(gi)iex such that g; = e; for all but finitely many 7 in J. 
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In this situation, for each 7, we fix a probability measure W; on G; and 
we fix a probability law p = (p;) on J. For each i, define a probability 
density @; on G by setting 


oily) = 


Wily) if y= (jer with y; = e; for alli Aj 
0 otherwise. 


We set dp = >) vidi. Note that we use the same notation as in the 
previous section with a somewhat different meaning. The semigroup 
e 'U-Rep) ig given by convolution with measures that we call hp . 
Obviously, 


[o,@) tn 
Pe. got (n) 
hy =e a nee : 
0 
By Proposition 2.1, we know that 


py” (e) X hP(e). (2.14) 


In the present setting, this is a very useful fact as we will show that the 
direct sum structure makes the behavior of he (e) easily accessible. 

On Gj, we have eU~ Rua) f = fxhj with hig = e~* WOE" /n! yw. 
This can be used to express e~ Hi Ro,) ¢ as follows. For each i, 7 € G 
and y € Gj set Ly = (%)jer where z; = 2; if 7 Ai and a = y. For 
any finitely supported function f on G, defines ff : Gj tH R by setting 
fi) = f (@t). Then, 


e UR4) F(x) = eH Bes) f(a). 


Further, viewing all Rg,’s as commuting convolution operators on G, 
we have 


I— Rg, = >) ri(I — Reg,) 
I 


and thus 
he = ieee). C= Ge: 
if: 


We need the following simple lemma. 


Lemma 6.1. On a finite or countable group Z, let w be a prob- 
ability measure such that w'™(e) < n < 1 for all n. Then hy = 


e FP" /n!)v™ satisfies 
et < hye) se 8, te 0,1) 


where Cc, € (0,1) is the unique solution to sel = pn. 
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Proof. The lower bound is obvious. For the upper bound, we write 
he(e) < e-*(1 + n(et — 1)). Set v(t) = 14+ n(e’ — 1). Then v(0) = 1 
v(t) = ne’ < c,e%! on (0,1). Hence v(t) < e!. This gives hi(e) < 
eGA-a)t ag desired. 


Proposition 6.2. Let G = )0,;G;. Fix n € (0,1). Let Wj be a 
probability density on G; such that for alli € I and n, vi” (ei) <n. 
Assume further that log hi,t(e;) & —H(t) for t > 1. Then 


loghP(e)~—t D> pe- >) Hector). 


k:tpp<1 k:ppt=1 


Proof. Write 


log hf (e) = }° log hi, tp; (ei) 


I 
= » log hi,tp; (ei) + » log hi, tp; (ei)- 
tpi<1 i:tpj>1 


To obtain the desired result, use Lemma 6.1 in the first sum and the 
assumption in the second sum. 


6.1. Examples on Direct Sums 


Example 6.3. Consider the case when G = )0; Gi with G; = Yas 

wi = Wo with wo symmetric with finite generating support on Z* so 

that hj,4(0) ~ t~4/2 for t > 1. Hence we can take H(t) = logt, ¢> 1. 
Case (a): Set pj = 2~*. We compute 


eS ee ee, 
k:ppt<1 k>logy t 


To compute Paes log(tp;,), we fix an integer € and consider all the 
integers & such that 


£ < log(tpy) < €+1, 
that is, 
£—logt < —klog2 < €4+1-logt. 


Hence, 


y log(tp,) & oa £~ (log t)?. 


pptzl é<logt 
Applying Proposition 6.2, we obtain 


log hp ¢(e) ~ —Clog t)?. 
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Case (b): Set pj = coi? with o > 1. Then 


1/o 
C C 
) Pk = Co y ko ~ = 1 (Co ) aia = 7 _ gre 
k:tpp<1 k>(cg t)1/o a oe 


To compute }> k:ippt>1 log(tp,), fix an integer @ and consider the values 
of & such that log tp, € [€,€+ 1). We must have 


£—logcegt < —o logk < €—logcgt +1, 


that is, 


(cg t)/% et Die <k < (etek: 
Hence, by Proposition 6.2, 
> log(tpz) & ye pie eT he pie 


k:ppt=1 é<logt 
This gives 
log hy ¢(e) aye. 


We note that these results apply in fact not only when Go = Z® but 
also when G is a group with polynomial volume growth. This includes 
all finitely generated nilpotent groups. 


Example 6.4. Consider the case when G; = Go with Go being a 

polycyclic group with exponential volume growth. Let W; = Wo with wo 

being a symmetric probability density with finite generating support on 

Go. In this case, we have H(s) ~ s'/3. See, e.g., [21, Sect. VII.5] and [1]). 
Case (a): Set pj = 2-*. Then )p.ty, <1 Pk X 1/t and 


1/3 
yee |) ee ae ea 


pet=l k<logs t 
Hence 
log hp ¢(€) ~ =7'/8, 


This result generalizes immediately to the case where Go and wo are 
such that H(s) ~ s® for some fixed a € (0, 1] (see, e.g., [15] for such 
examples). In this case, log hp ¢(e) ~ —t®. 

Case (b): Set pj = cgi~® with o > 1. Then D0 y-45, <1 Pk = tl+1/o as 
in Example 6.3(b). Further 


tl/3 ifo >3 
YA ee Sy ee Be loge: itor s3 
pete k<tlle tl/e ifo <3 
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By Proposition 6.2, it follows that 


aye ifo >3 
loghps(e)~ —4 t'/Slogt ifo =3 
ge ifo <3. 


This result generalizes immediately to the case where Go and wo are 
such that H(s) ~ s® for some fixed a € (0,1] (see, e.g., [15] for such 
examples). In this case, 


to ifo > 1/a 
loghpt(e)~¥ —4 t*logt ifo=1/a 
ae ifo <1/a. 


Our next two examples treat examples where G = )°,; G; and the 
G;’s are finite groups. 


Example 6.5. Consider the case when G = )°; Gj with Gj = Go, 
a finite group, and ¥; = Wo with Wo symmetric with finite generating 
support on Go. Then we can take H(t) = 1, t > 1. 

Case (a): Set pj = 2~*. We have 


Y> pe =A/tand > 1~ log(t). 
k:ppt<1 prtel 
Applying Proposition 6.2, we obtain 
log hp ¢(e) ~ — log t. 


In this case, a more precise estimate is desirable. 

Case (b): Set pj = cot? with o > 1. Then )oj.i5, 21 Pk 

1/o 
a a Burther- Ds gps be t!/° as well. Hence, by Proposition 
6.2, 

log h? (e) Bo ape 

In case (b) the behavior obtained when Go is finite is the same as 

when Go has polynomial volume growth. 


Example 6.6. Consider the case when G; = Z/mj,Z is a cyclic finite 
group. On each of these finite group we let w; be the probability density 
that is uniform on {—1,+1}. In this case, we observe that 


t if t € (0,1] 
log hi,e(0)~—} log(l+t) if t € (l,m?) 
log my ift> m?. 


Sée, €:2.5-[19].- 
Set p; = 2~*. We will consider three cases depending on the behavior 
of the sequence m;. We assume that m; ~ m(7z) where m is an increasing 
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function and log om is regularly varying at infinity, of index a € [0, 00). 


In all cases, 
So pee ye 
k:ppt<1 


Case (a): Assume that a € [0,1] and that limsup;_,,, t7! logom(t) 
< oo. Then, for any t > 1 


> log mp ~ log(t) log{m(og £)}. 


kippt=mz 
Further, 
Ss log(tpz) < C ye log m(k) < C log(t) log[m(log t)]. 
k:ppte[1,m2) ke[1,log t) 


Hence, we obtain 
log h? (e) ~ — log(t) log[m (log #)]. 
Case (b): Assume that a € [1, 00) and that lim inf;_, 5 t7! log om(t) > 0. 
Then 
> log mp < C log(t)?. 
k:ppt>m? 
Further, using the computation form Example 6.3(a), we have 
Y > log(tpp) © log(t)?. 
k:ppte[1,m?) 
Hence, we obtain 
log h? (e) i= log(t)?. 


6.2. Comparison Inequalities 


In this last subsection, we give a simple result that allows us to extend 
the estimates described above to measures that are not exactly adapted 
to the direct sum structure of the underlying group. 

We continue to assume that G = )°, Gj as earlier in this section and 
consider 


Pp = S- vidis 


where each ¢; is a symmetric probability measure on G; with associated 
Dirichlet form €%, and set €4, = €;. 
Assume further that each ¢; satisfies the pseudo-Poincaré inequality 


Vy¥e Gi, >) fay) -—f@P? < Pipi (WE. /)- (2.15) 


rEeG; 
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Example 6.7. If for each 7 there is a finite symmetric generating set 
S; of G; such that ¢; => «; > 0 on S$; then (2.15) holds true with p;(y) 


being the word-length distance between e; and y in G; based on the 
generating set S$; and P; = et 


Given y € Gj, let y’ € G= >>7 Gi be the element yy? = (yj) with 
yy = ej if 7 Ai and y; = y. Given a function f ¢ £2(G), G= >>; Gi, set 
&(f. f= 0/2) SY) ify) —f@)?oity). 

reG,yeGi 
Lemma 6.8. Assume (2.15). For any y = (yi) € G, set 
2 1/2 
pp(y) = (x Fon?) 
ra 
Then we have 


Vy eG, D> If(cy) —f@/ < ppy)Eo (Ff). (2.16) 
zeG 


Note that pp(y) is finite for any y = (y;) because y; = e for all but 
finitely many 7. 


Proof. To simplify notation, we write p = pp. Given y = (y;), let y* 
be the element in G with coordinates y;" given by a = yj if7 <7 and 
ye = e; if7 > i. Then 


fan=f@= > feria”) 


a 


(note that only finitely many terms are nonzero in this sum) and, by 
Minkowski inequality, 


PCy —fOll2 =I 5 fey) = FOyO Pile < Yo IF Gy") = FO. 
Next, using (2.15), we have 
So fcy') —fOlla so PPE? 


= Pi/po" ex iE, Py? 


1/2 
< p(y)? (x Pes.) 


=e) apy 


Proposition 2.2 and 6.3 together yield the following statement. 
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Proposition 6.9. Referring to the setting and notation introduced 
above, let v be a symmetric probability measure on G = )°; Gi such 
that v(p5) < oo. Then there are C',k such that 


Me) <C (ve) + eu) : 
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Abstract 

In a graph, a set of vertices that is stabilized setwise by only the trivial 
automorphism is called a distinguishing class. Not every graph has such a 
set, but if it does, we call its minimum size the distinguishing cost. Many 
families of graphs have such sets, and for some families the distinguishing 
costs are surprisingly small. 

The chapter begins with a survey of results about the distinguishing 
cost for finite and infinite graphs. Then it focuses on infinite graphs 
with finite cost and presents a new upper bound on the distinguishing 
cost of graphs with linear growth, two ends, and infinite automorphism 
group. It ends with the problem of the existence of one-ended, connected, 
locally finite infinite graphs of linear growth and countably infinite 
automorphism group. 


Key words: Distinguishing number, automorphism, determining set, 
determining number, infinite graph 


AMS subject classification (2000): 05C15, 05C25. 


Contents 


Lama ea 


ol 


Introduction 
Preliminaries 
Finite Graphs 
Infinite Graphs 
Outlook 


Acknowledgment 
References 
Added in Proof 


1. 


A significant number of recent papers investigate vertex colorings that 
are only preserved by the identity automorphism. Such colorings are 
called distinguishing. If two colors suffice, the coloring is said to be two- 
distinguishing, and the graph is said to be two-distinguishable. Given a 
two-distinguishing coloring, the set of vertices of either of the two colors 
is stabilized setwise only by the trivial automorphism. Such a set is called 
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a distinguishing class, and the minimum size of a distinguishing class of 
a graph G is called the distinguishing cost; we denote it by p(G). 

The concept of a distinguishing class is different from that of a 
determining set. A determining set is a set of vertices whose pointwise 
stabilizer is the identity. Such a set is also called a base for the action of 
the automorphism group. A determining set need not be a distinguishing 
class, but every distinguishing class is a determining set. The minimum 
size of a determining set for G is called the determining number, Det(G). 
Clearly Det(G) < p(G). 

By [10], an infinite, connected, locally finite graph G with infinite 
automorphism group has finite distinguishing cost (and thus finite 
determining number) if and only if the automorphism group is countable. 
In such a case, p(G) cannot be larger than 3 - Det(G). However, for 
finite graphs p(G) and Det(G) can be very far apart. We show this by 
constructing a family of finite graphs in which the distinguishing cost is 
a large multiple of the determining number. 

Such constructions can easily be extended to finite graphs of any size 
and infinite graphs. Because, given a nontrivial graph G and a path P 
of arbitrary length or a one-sided infinite path R, then the graph H, 
obtained from G and P, respectively R, by joining one end vertex of P, 
respectively the end vertex of R, to all vertices of G, has the same cost 
and determining number as G. This is so because every automorphism 
of H fixes all vertices not in G, and every automorphism of G extends 
to one of H. 

Our main result is a new bound on the distinguishing cost of infinite, 
connected, 2-ended, locally finite graphs with infinite automorphism 
group and linear growth. 

For infinite graphs the proofs rely on facts about structure, ends, and 
automorphisms of infinite, locally finite graphs given by Halin [14], and 
more recently in [16]. For finite graphs, Cartesian products are the main 
tool. 

To gain perspective, we also present a short survey of results about the 
distinguishing cost of finite and infinite graphs, the relationship between 
p(G) and Det(G), and a partial list of classes of graphs that are two- 
distinguishable. 


2. Preliminaries 


Distinguishing two-colorings are a special case of distinguishing d- 
colorings. By a coloring of the vertices of a graph G with d colors we 
mean a labeling with the integers 1,..., d. If no nontrivial automorphism 
preserves the colors we call this a d-distinguishing coloring. A graph with 
such a coloring is called d-distinguishable. This concept was introduced 
by Albertson and Collins in [3] and has spawned a wealth of results for 
both finite and infinite graphs. 
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In many families of graphs, all but a small finite number of mem- 
bers are two-distinguishable. For example hypercubes Q, are two- 
distinguishable for n > 4, Cartesian powers Kz’ for n > 3, and G” for 
connected graphs G 4 Ko, K3, where n > 2 [1, 9, 18, 19]. Other examples 
are Kneser graphs KG’, ;, for n > 6, k > 2 [2], and three-connected planar 
graphs (with seven small exceptions) [17]. 

Families of infinite two-distinguishable graphs include homogeneous 
trees of arbitrary degree [4], the hypercube of arbitrary transfinite dimen- 
sion [15], denumerable, vertex-transitive graphs of connectivity one [21], 
the denumerable random graph [15], and connected, locally finite graphs 
of polynomial growth with the property that every automorphisms moves 
infinitely many vertices [20]. 

As noted previously, the color classes in a two-distinguishable coloring 
of a graph G are called distinguishing classes, and the minimum size 
of a distinguishing class is the cost of two-distinguishing G, or the 
distinguishing cost of G, and is denoted by p(G). This concept was 
introduced and studied by Boutin in [5], motivated by the observation 
that the size of the smaller color class in a two-distinguishing coloring 
of a graph can be extremely small. For example, she showed that the 
hypercube Qs of dimension 2* can be two-distinguished by coloring 
k+2 vertices black’ and the others white [7]. Moreover, there are infinite 
graphs with infinite automorphism group and finite distinguishing cost. 
In particular, if G is a connected, locally finite graph with infinite 
automorphism group, then o(G) is finite precisely when Aut(G) is 
countable [10]. 

In both the finite and the infinite case, there is a connection between 
the minimum size of a determining set and that of a distinguishing 
class. Recall that a determining set for a graph G is a set whose point 
stabilizer in Aut(G) is trivial, that the determining number Det(G) is 
the minimum size of a determining set of G, and that Det(G) < (G). It 
is possible that Det(G) = e(G), but we show that in finite graphs p(G) 
can be an arbitrarily large multiple of Det(G). 

We then provide a new upper bound for o(G) when G is a connected, 
locally finite graph with linear growth, infinite automorphism group and 
two ends. This is our main result. Of course, it also implies an upper 
bound on Det(G). 


3. Finite Graphs 


Most of the results in this section pertain to Cartesian products and 
Cartesian powers of graphs. Many proofs invoke the structure of the 
automorphism group of Cartesian products. The kth Cartesian power of 


TIna two-coloring we assign black to the smaller class and white to the other, unless 
both classes have the same size. 
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a graph G, denoted eas is the Cartesian product of G with itself & times. 
A good reference for both products and powers is [13]. In particular, 
automorphisms of Cartesian products are discussed beginning on page 
128. Recall that a nontrivial graph G is prime with respect to the 
Cartesian product if it cannot be written as the Cartesian product of 
two smaller graphs. In particular, if G and 4H are relatively prime, that 
is, if they have no common nontrivial factor, then the automorphism 
group of GU4. is the direct product of the automorphism groups of G 
and H. To be more precise, in this case, gy € Aut(GU #) if and only if 
there is a € Aut(G) and B € Aut(#) so that for all (g,h) e V(GO A), 
(9, h) = (a(g), B(h)). When useful, we write g = Yo,g. 

We begin with the hypercube Q;, which is the kth Cartesian power 
of the complete graph K2. In 2004, Bogstad and Cowan [9] showed that 
for k > 4, Q, is two-distinguishable. The two-distinguishing coloring 
produced by their method gave a distinguishing class of size k+2. Imrich 
asked how small a distinguishing class could be for hypercubes. In the 
first paper to address the question of distinguishing cost, Boutin found 
upper and lower bounds within a factor of 2 of each other. The theorem 
is stated below. 


Theorem 3.1. [5] For k => 5, floggk] +1 < p(Qz) < 2flogs k] — 1. 


Later, precise values for the distinguishing cost of certain hypercubes 
and certain Kneser graphs were obtained. 


Theorem 3.2. [7] For m = 3, p(Qam) — p(Qam_1) = p(Qam_2) = 
m+ 2. 


Theorem 3.3. [7] p(KGgm_j 9m-1_1) =m+1. 


Before further examining graphs with a small distinguishing cost, it is 
natural to consider how large the distinguishing cost might be. Clearly, 
in a two-distinguishing coloring, one of the color classes must be at most 
half the size of the vertex set. This bound can be achieved as the following 
theorem shows. 


Theorem 3.4. [12] Let G be a graph with trivial automorphism group 
and H the lexicographic product of G and K. Then p(H) = | V(1)|. 


Proof. First observe that H is obtained from G by replacing every 
vertex uv by an edge vj v2, and by replacing every edge uv by the edges 
ujvj, where 7,7 € {1,2}. Clearly the transposition of any pair {v1, v2}, 
where all other vertices are fixed, is an automorphism. To break it we 
have to assign different colors to v1 and v2, say black and white. Doing 
this for all pairs we see that H consists of a copy of G that has only black 
vertices and a copy that has only white vertices. Clearly the two copies 
of G cannot be interchanged. The observation that G is asymmetric 
completes the proof. 
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We wish to remark that for each two-distinguishing coloring of H 
above, though there is no nontrivial automorphism that preserves the 
color classes, there is one that switches them. 

At the end of this section we will consider the graphs H = KomOG, 
where G is an asymmetric graph of order m. For graphs in this family 
we also have p(H) = 3|V(H)|. 

The fact that Q, is Kg was instrumental in proving Theorem 3.1. For 


the cost of Ke a detour via the determining number of Cartesian powers 
of graphs was taken. For such powers we have the following theorem 
regarding determining numbers. 


Theorem 3.5. [6] Let G be a connected graph of order n that is prime 
with respect to the Cartesian product. Then 


max {Det(G), [ (og, k + log, |Aut(G)|)]} < Det(G*) 
< Det(G) + [log, k]. 


Note that since G is fixed, Det(G), |V(G)|, and |Aut(G)| are constant. 
Thus, both upper and lower bounds on Det(G*) grow logarithmically 
with k. 

For graphs on at least three vertices meeting other mild hypotheses, 
the connection with distinguishing cost is provided by the next result. 
This will imply that both upper and lower bounds on p(G*) also grow 
logarithmically with k. 


Theorem 3.6. [8] If G" is a two-distinguishable Cartesian power of a 
prime, connected graph G on at least three vertices with Det(G) < k and 
max{2, Det(G)} < Det(G"), then Det(G") < p(G*) < Det(G*) +1. 


Using the known value of Det (KS ) from [6], the following theorem was 
an immediate consequence. 
Theorem 3.7. [8] For k>3, [log3(2k+1)] + 1<p(Kf)< 
[logs (2k + 1)| + 2. 


We continue with a new result about finite graphs with large distin- 
guishing cost. 


Theorem 3.8. If H is an asymmetric graph on m vertices, then 
p(Kom OH) = m2™!, 


Proof. Let V(A) = {hi,...,hm}. Recall that all complete graphs are 
prime with respect to the Cartesian product. Because Kgm has more 
vertices than H, it cannot be a factor of it. Hence Kom and H are 
relatively prime. Using the notation from the beginning of this section, 
given that H is asymmetric, all automorphisms of Kom OH are of the 
form Yo,id = Ya. That is, yy € Aut(Kam 0H) permutes the sets H* = 
{(z, hi) | hi € H},z € V(Kom), taking H” = {(2,hi) | hi € H} to 
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H®®) = {(a(x), hj) | hy € H}. The sets H* = {(z,h,) | hj € H} are called 
H-fibers. Notice that yy, when applied to a vertex, only changes the first 
coordinate. 

There are 2” distinct two-colorings of H. Thus, we may assign distinct 
two-colorings to each of the 2” H-fibers of Kom OH. Any nontrivial 
automorphism of Kam 0 H (which is necessarily induced by a nontrivial 
automorphism of Kam) must nontrivially permute H-fibers. Because 
each H-fiber has a distinct coloring, no such nontrivial automorphism 
preserves the coloring. Thus, the coloring is distinguishing. 

Note that the coloring produces two equal-size color classes of size 
m2™-1, because 7729 ()i = m2™—1. Thus, p(Kam OH) < m2™—1. 

Now, consider a two-coloring ¢ of Kom OH with fewer than m2™-! 
black vertices. Because having distinct two-colorings of H-fibers implies 
m2™—1 black vertices, not all two-colorings of H-fibers under c can be 
distinct. Thus, there is some z # y € V(Kam) so that H” and HY have 
the same two-coloring. Let a = (x y) be the transposition of x and y 
in Kom. Then yy simply switches H® and HY in KomO H, fixing all 
other vertices. This tells us that ya is a nontrivial automorphism that 
preserves the coloring and thus the coloring is not distinguishing. We 
conclude that p(Kom OH) = m2™-!, 


Clearly, p(Kom 0 A) = $I V (Kom 0 A). Furthermore, there is again 
an automorphism that interchanges the color classes. 

It is easy to see that Det(Kom) = 2™ — 1. Hence p(Kom) = m2™-! = 
e(2™ —1)+ F > F-Det(Kom). We thus have the following corollary. 


Corollary 3.9. To any given integer n there are infinitely many graphs 
G such that p(G) > n- Det(G). 


Proof. The assertion of the corollary is true for any graph Ko~QO 4H, 
where H is an asymmetric graph on m vertices and 4 > n. 


4. Infinite Graphs 


We already mentioned that there are many families of infinite graphs that 
are two-distinguishable. In this section we survey the results previously 
known about the distinguishing cost of some of these graph families. 
In particular, we focus on the class of infinite, connected, locally finite 
graphs, denoted I’. 

An infinite graph is called locally finite if all its degrees are finite. For 
such graphs we have the following fundamental result. 


Theorem 4.1. [10] Let G eT with infinite automorphism group. Then 
p(G) is finite if and only if Aut(G) is countable. 


Note that the above theorem implies that, within [, a countably 
infinite automorphism group also implies a finite determining number. 
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Along the way to proving Theorem 4.1, the authors prove the 
following. 


Theorem 4.2. [10] Let G eT have countably infinite automorphism 
group. Let Det(G) = n. Then 


p(G) < =| BGs 


where b(n) denotes the number of 1s in the base-2 representation of n. 


Another result pertains to graphs of linear growth. A graph has linear 
growth if the number of vertices of distance at most n from a given 
vertex grows linearly with n. To be more precise, recall that the set 
of vertices u € V(G) for which d(u,v) < n is called ball of radius n 
centered at v, and denoted B,(n). For later reference we also define the 
sphere Sy(n)of radius n centered at v as the set of vertices u € V(G) for 
which d(u,v) = n. 

A connected, locally finite, infinite graph G is said to have linear 
growth if there exists a vertex v and a constant, real number c, such 
that |By,(n)| < cn, for all n € N. The definition is independent of the 
choice of v, but c may have to be replaced by a different constant if v 
is changed. We say that G has linear growth with growth constant c. 
Notice that c > 1, because By(n) has at least n + 1 vertices. 

In [10] the authors studied the distinguishing cost for graphs in T with 
linear growth, with the following result: 


Theorem 4.3. [10] Let GET with countably infinite automorphism 
group and linear growth with growth constant c. Then p(G) < max(3, 
c+ 1). 


The bound is sharp for c = 2. To see this, consider the two-sided 
infinite path Py,. It has growth constant 2, and we know that it can 
be distinguished by just three black vertices, but not by two. Hence 
p(Px,) = 3, which is the bound given by the theorem. However, in the 
next section we improve the bound for c > 3. 


4.1. A New Bound for Linear Growth 


One of the main ingredients of the original proof of Theorem 4.3 is the 
fact, see [16, Corollary 3.8], that every graph G € IT with countably 
infinite automorphism group has a finite determining set, and that 
every nontrivial automorphism of G moves infinitely many vertices. A 
subgroup of Aut(G) is said to have infinite motion if each of its nontrivial 
automorphisms moves infinitely many vertices of G. The graph G is said 
to have infinite motion if Aut(G) does. 

We use the notation Aut(G),y for the stabilizer of vertex v, that is, for 
the subgroup of Aut(G) that consists of all automorphisms that fix v. 
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It will be useful to know that by [10, Lemma 2.3], in a graph in T with 
a finite determining set, vertex stabilizers are finite and have infinite 
motion. 

The proof of Theorem 4.3 constructs, for a given, fixed v, a two- 
coloring that is not preserved by any nontrivial automorphism in 
Aut(G)y. To improve on the conclusion of Theorem 4.3, we use a similar 
procedure, along with the conclusion of Proposition 4.4 below. Later we 
will make use of this proposition in our investigation of graphs with two 
ends. 


Proposition 4.4. Let v be a vertex of a graph G € T such that 
Aut(G)y is a finite group with infinite motion. If G has linear growth 
with growth constant c, then 


p(G) < max(3, c). 


Further, the distinguishing coloring with cost p(G) can be chosen such 
that v is colored black. 


Proof. For c < 3 the arguments in the proof of Theorem 4.3 from [10] 
yield the bound p(G) < 3. 

Hence, we can assume that c > 3. Fix a vertex v so that |By(n)| < cn. 
We need the following facts from the proof of Theorem 4.3 in [10]. As 
the proofs are relatively short, we include them for the benefit of the 
reader and the sake of completeness. 

Fact 1 Infinitely many spheres with center v have size at most c. 

For, if only finitely many have size at most c, then there is an index ng 
such that |Sy,(n)| > c for n > no. If |By(no)| = cno, then |By(n)| > cn 
for n > no. Hence cnp + 1 — |By(no)| is a positive integer, we denote it 
by d. Clearly 


|By(no + 4)| = |By(n0)|+ d(e+ DI] = (eng +1—d+(e+ld) > c(no+ 4), 


which violates the fact that the growth constant is c. 

Fact 2 There is a natural number ), b < c, such that infinitely many 
spheres with center v have size 0. 

This follows immediately from Fact 1. Notice that 6 4 1; otherwise G 
could not have infinite motion. 

Fact 3 There is a sequence of spheres R; = Sy(nj;), for ny < ng <--- 
of size b, such that, for 7 < i, the distance between arbitrary vertices in 
R; and R; is larger than the maximal distance between any two vertices 
in U;<;R;, for each 7 > 1. 

Fact 4 We can assume without loss of generality that every nontrivial 
element of Aut(G)y acts nontrivially on each of the R;. 

Aut(G), has infinite motion by [10, Lemma 7]. Hence, if a € Aut(G), 
fixes S,(n), then it fixes B,(n). Thus, again by infinite motion, such an 
a acts nontrivially on all but finitely many of the R;. Without loss of 
generality, we may assume that Aut(G),-—{id} acts nontrivially on all R;. 
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Fact 5 There is a bijection ¢; : Ry > R; so that for eacha € Aut(G)y, 
alk = $;'(@|Ri)gi. 

For each 7 > 1, fix a bijection ¢; from R, to R;. Let ¢; = id on Ry 
and denote the group ; | (Aut(G)y| Rigi by A;. The A; are subgroups 
of Sym(R1). Furthermore, we chose an a € Aut(G), and consider the 
infinite set of permutations {6; = ; (al Rigi} Cc Sym(R1). As Sym(R1) 
has only finitely many elements, infinitely many of the 6; must identical. 
Let J be the set of indices for which this is the case. Notice, this set 
need not include 1. Let jo be the smallest of these indices. Clearly, for 
jeJall 


Vj = Pind; (OLR )j9;," 
are identical on Rj, and 
-1 =4 
Vio = Pio Pin Rin Ibn Pj, = Hl Rio - 


Notice that the $j%), are bijections from Rj, to R;. Hence, we can 
assume without loss of generality that J is N; in other words, that all 
fj are identical and that 6; = a|R1. 

Because Aut(G),y is finite, proceeding successively, we can assume 
that the assertion of Fact 5 holds. 

Having established these facts, we now suppose a € Aut(G), fixes 
a vertex @j(u) in Rj. Then a(u) = ¢;adi(u) = 6; (a@(pi(u))) = 
; 'di(u) =u. That is, ifa € Aut(G)y fixes ¢;(u) in R;, then a fixes u 
in Ry. 

Let v1, v2,...,vp be the vertices of Rj, and X = {v, $1(v1), $2(v2),..., 
bp—1(vp_—1)}. Color the vertices of X black, and all other vertices in the 
graph white. Suppose a € Aut(G) preserves color classes. 

Suppose first that b > 2. By our choice of distances between the 
spheres R1,..., Ry_1, the vertex v is uniquely identified by its distances 
to the remaining (at least two) vertices of X. Because a preserves 
X, it must fix v. And because a must also preserve distance to v, 
a cannot permute the remaining vertices of X. Thus, @ fixes each of 
1(v1),---,@b-1(vp_1). By our earlier argument, a therefore fixes each 
of vj,...,Vp—1. Given that a € Aut(G), must preserve R1, a sphere 
centered at v with 6 vertices, a also fixes the remaining vertex vy of Ry. 
But the spheres were chosen so that every nontrivial automorphism in 
Aut(G),y acts nontrivially on each sphere, and in particular on R,. Thus, 
@ itself is trivial. And X is a distinguishing class of size b < c for G. 

If b = 2, then we have colored only two vertices black, namely v and 
v,. Hence, to make sure that they cannot be interchanged we choose a 
third vertex x such that the distances between any two vertices from 
the set {v, v1, z} are different, and color it black. 


An immediate consequence is the following theorem, which improves 
Theorem 4.3 for c > 3. 
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Theorem 4.5. Let GeT with countably infinite automorphism group 
and linear growth with growth constant c, then 


p(G) < max(3, c). 


Proof. Because G € I has countably infinite automorphism group, it 
has a finite determining set [14], and thus finite vertex stabilizers [10, 
Lemma 2.3]. Furthermore, because G also has linear growth, Aut(G) 
and Aut(G), have infinite motion [10, Lemma 4.2]. Application of 
Proposition 4.4 completes the proof. 


4.2. Graphs with Two Ends 


In this subsection we prove auxiliary results that we will use, together 
with Proposition 4.4, for the proof of our main theorem. 

Good references for the ends of infinite graphs are [11, 14]. However, 
the following basic definitions and properties should be sufficient. To 
define ends of a graph we first observe that every connected, infinite, 
locally finite graph must have a one-sided infinite path. Such paths are 
called rays. A subgraph of a ray that is itself a ray is called a tail. 
Two rays are equivalent if, for every finite set F C V(G), both have 
a tail in the same component of G — F. This relation is indeed an 
equivalence relation, and the resulting equivalence classes are called ends. 
For example, a finite graph has no ends, the two-sided infinite path Px, 
has two ends, Px, UO Px, has one, and the infinite homogeneous tree Tg 
of degree d > 2 has infinitely many ends. 

To show how ends relate to subgraphs of G, let us consider an end e 
and a finite F C V(G). Take a ray R € e. Because F is finite, R can 
meet F' only finitely often. Hence, all but finitely many vertices of R 
must be in the same component, say C’, of G— F. In other words, a tail 
of R is in the infinite component C’. It is easy to see that any other ray 
in e also has a tail in C. 

Note that each infinite connected component of G — F contains a ray 
and therefore an end. Thus, if G has only two different ends, say e and f, 
they correspond to the same infinite component C' of G — F only when 
C is the sole infinite component of G — F. Otherwise, that is, when 
G — F has exactly two infinite components, say C1, and Cy, one of the 
ends corresponds to Cj and the other to C. 


Lemma 4.6. Let G é€T and |Aut(G)| = &o. Then the orbits of all 
vertices of G are infinite. 


Proof. By [14], a connected, locally finite graph, with countably infinite 
automorphism group has a finite determining set, say D. Suppose no 
vertex of D has infinite orbit. Then the orbit O(D) of D is finite. Because 
Aut(G) is infinite, and O(D) is an orbit that is finite, there must be two 
distinct automorphisms, say a and £, that agree in their action on O(D). 
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Thus, their actions also agree on the determining set D. By definition of 
determining set, this means a = 6, contrary to assumption. 

So there is at least one vertex of D, say v, with an infinite orbit. 
Because G is locally finite, every ball of finite radius around v contains 
only finitely many vertices. Thus, since the orbit of v is infinite, to every 
positive integer d there is an automorphism @ such that d(v,a(v)) > d. 

Let w be any vertex of G distinct from v, and let m = d(v, w). To 
prove that w has an infinite orbit, we show that there are automorphisms 
a;,7 € N, such that the vertices a;(w),i € N are pairwise different. 
We define the a; by induction, beginning with a1 = id. Suppose 
Q1,02,...,@% are already defined and the vertices in Wy = {aj;(w) : 
1 <i < k} are distinct. We know there is an automorphism a@ such that 


d(v,a(v)) > max{d(v,aj(w)):1<i<k}+m. 


Then a(w) ¢ Wy. Hence, setting a,4; = a, the elements in Wz41 are 
clearly pairwise distinct. 


The following immediate result generalizes the preceding work to any 
infinite subset of Aut(G). 


Corollary 4.7. Let GeéT have the property that |Aut(G)| = Xo. If 
A is any infinite set of automorphisms of G, then all vertices of G have 
infinite A orbits. 


Lemma 4.8. Let G eT be 2-ended with countably infinite automor- 
phism group and linear growth. Then, for every finite set of vertices F, 
every nontrivial automorphism a of G moves infinitely many vertices in 
each infinite component of G— F. 


Proof. In the following, we show that, if there exists a nontrivial 
automorphism of G that moves only finitely many vertices of some end of 
G, then to any finite set of vertices F' there exists a nontrivial a € Aut(G) 
such that a(F) = F’. By [14, Theorem 6], this would mean Aut(G) is 
uncountable, a contradiction to our hypothesis. 

Because G has two ends, there is a finite separating set S of G such 
that G — S has precisely two infinite components C;, and C2. We wish 
to show first that there are infinitely many automorphisms that do not 
switch ends. If only finitely many automorphisms preserve ends, then 
there are infinitely many automorphisms y,y1,y2,... that do switch 
ends. Consequently, the infinitely many automorphisms y y1, y y2,... do 
not switch ends. Further, using the methods of the proof of Lemma 4.6, 
we conclude that there are infinitely many automorphisms that do not 
switch ends and also move any finite set arbitrarily far. 

Suppose F is a finite set of vertices and infinitely many automorphisms 
move F into Cy — By(m) for an arbitrary, but fixed B,(m). Then their 
inverses (again, infinitely many) move F into Cj — By(m). 
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By [16], given that G € IT has infinite automorphism group, it has 
infinite motion. That is, every nontrivial automorphism of G moves an 
infinite number of vertices. Thus, each must move an infinite number of 
vertices in at least one of Cy and Co. 

Assume now that a nontrivial a € Aut(G) moves only finitely many 
vertices of C,. Let v be a vertex in the finite set S that separates C} 
from C2, and let m be the maximum distance from v of any of the 
finitely many of vertices in C, that are moved by a. Then a fixes every 
vertex in Cy, — By(m). Now, let F be any finite set of vertices of G. 
By previous remarks, since all vertices have infinite orbit, and there 
are automorphisms that do not switch ends but also move a finite set 
arbitrarily far, there is 8 € Aut(G) such that B(F’) C Ci, — By(m). Then 
B~ ‘a is a nontrivial automorphism that fixes F pointwise. 


4.3. The Main Result 
We now have all prerequisites for the proof of the main result. 


Theorem 4.9. Let GET be a 2-ended graph with countably infinite 
automorphism group and linear growth with growth constant c. Then 


e(G) < max (3, =) . 


Proof. Because G has linear growth with growth rate c, there exists 
v € V(G) so that |By(n)| < cn for all n € N. Because G has two 
ends, there is an m € N so that for all m > N, G— By(m) has exactly 
two infinite components. Let C, and C> be the infinite components of 
G — By(m), and Rj and R2 the boundary points of B,(m-+ 1) that are 
in C1, respectively in Co. By choice of m, Ry and Ro are disjoint and 
their union is S,(m-+ 1). Note that the growth rate of one of C1, C2 is 
at most 5; otherwise, the growth rate of G would be larger than c. We 
choose the notation such that C, has growth constant at most 5. 

We now observe that C1, together with uv and all shortest paths 
between v and Cj, is a graph with growth constant at most 5. We 
denote it C. (C is the convex hull of Cy, U {v}.) Clearly Aut(C), 
contains the restriction of Aut(G), to C. Given that we are only 
concerned with automorphisms arising from Aut(G), we will restrict our 
consideration to these automorphisms; denote this subgroup of Aut(C)y 
by H. Because we may consider H < Aut(G)y, H has infinite motion, 
and by Proposition 4.4, we infer that C has a distinguishing coloring 
of cost max(3, 5) under the action of H and that v can be chosen as a 
member of the minimum size distinguishing class. 

By Proposition 4.4 we can choose a vertex set S such that v € S; all 
other vertices of S are in C1, and S is a distinguishing class for the action 
of H on C. Color the vertices of S black and all other vertices of G white. 
Using the methods of the proof of Proposition 4.4 we may assume that 


DEBRA BOUTIN AND WILFRIED IMRICH 116 


v is distinguished by its distances to the other vertices of S. Thus, any 
automorphism @ that preserves S must fix v. Also, a cannot interchange 
C, and C2, because all vertices in Cp are white. Thus, we may consider 
a € H. Because S is a distinguishing class for the action of H on C, a acts 
trivially on C' and thus on Cj. But the only automorphism that moves 
at most a finite number of vertices in C1 is the trivial automorphism. 
Thus, S is a distinguishing class for G. 


Note that Proposition 4.4 can be slightly improved for graphs with 
two ends. One simply applies it twice to the two ends of G and uses the 
fact that the vertex v for which |By(n)| < c is the same in both ends: 


Proposition 4.10. Let v be a vertex of a 2-ended graph G €T such 
that Aut(G),y is a finite group with infinite motion. If G has linear growth 
with growth constant c, then p(G) < max(5, c— 1). 


5. Outlook 


Superficially it looks like we have the bound max{3, c} for one-ended 
graphs, and max{3, 5} for two-ended graphs. But there is a catch. We 
have not been able to find a graph in T’ that has linear growth, only one 
end, and countably infinite automorphism group. 

We show in the following that all graphs in I with linear growth and 
countably infinite automorphism group have at most two ends. Hence, 
the nonexistence of one-ended graphs of linear growth and countably 
infinite automorphism group would mean that we always have the bound 
max{3, 5}. 

For vertex transitive graphs this is indeed the case. It is well known 
that vertex transitive graphs with infinite automorphism group have 1,2 
or infinitely many ends, and that, when there are infinitely many ends, 
their growth is exponential. Hence, in the case of linear growth, we have 
at most two ends. In fact, it is not hard to show that vertex transitive 
graphs of linear growth must have two ends, so for them the bound is 
max{3, 5}. 

However, without vertex transitivity we can only show that graphs 
in I with linear growth have at most two ends. We do this with two 
lemmas. 


Lemma 5.1. Every graph G € T with infinite automorphism group 
has 1, 2, or infinitely many ends. 


Proof. We have to show that a graph G € IT with infinite Aut(G) has 
infinitely many ends if it has more than two ends. If G has more than 
two ends, there is a finite set F such that G—F has at least three infinite 
components, say r, r > 2. 

If G has only finitely many ends there is a finite set S such that G—S 
has the maximum number of infinite components, say s > 2. We also 
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admit S = F. We show that there is an automorphism a@ such that a(F’) 
is contained in an infinite component of G— S and that S is completely 
contained in an infinite component of G — a(F). 

To see this, let F’ be the union of F' with all finite components G — F 
and S$" be the union of S$ with all finite components of G — S. We now 
observe that the orbit of every vertex is infinite by Lemma 4.6. Hence 
there exists an a € Aut(G) such that the distance from a(F’) to S’ 
is larger than the diameter of F'. This means that a(F) is completely 
contained in an infinite component of G — S, say Cyr), and S in an 
infinite component of G —a(F). 

Consider SUa(F) and the components of G—{SUa(F)}. Any infinite 
component of G — S different from Cy) and any infinite component of 
G—a(F) different from Cy) is a component of G—{SUa(F)}, because 
every path between any two of them must go through F,, a(F), or both 
F and a(F). 

Hence, G— {SUa(F)} has at least (s —1)+(r-—1) = s+r—2>s8+4+1 
infinite components, contrary to the maximality of s. 


Lemma 5.2. A graph with linear growth and an infinite orbit cannot 
have more than two ends. 


Proof. Suppose G is a graph of linear growth with growth constant c, 
an infinite orbit, and more than two ends. By Lemma 5.1 it has infinitely 
many ends, and thus a set of c+ 1 nonequivalent rays {Uj1,..., Uc+1}. 
Because they are nonequivalent, each pair of rays is disjoint after a finite 
number of vertices. Thus, there is an integer Mj so that U; and U; 
are disjoint after their Me vertex. Let M = max{Mj,j}; je[e41]- We can 


conclude that Uj,..., Uc41 are pairwise disjoint after their M‘” vertex. 
By disjointness, |Sy(nm)| => c+1 for all n > M. Further, |S,(n)| > 1 for 
n < M. Thus, we have |By(n)| > M+ (c+1)(n — M), which can be 
seen, for n large enough, to violate the fact that the growth constant 
is c. Thus, given G has linear growth, it cannot have more than two 
ends. 


Theorem 5.3. All graphs in T that have linear growth and countably 
infinite automorphism group have one or two ends. 


Proof. By Lemma 4.6 we have an infinite orbit and we can apply 
Lemma 5.2. 


This provides the background for the following problem. 


Problem Does there exist a one-ended graph in T that has linear growth 
and countably infinite automorphism group? 

As mentioned earlier, G cannot be vertex transitive. In fact, G must 
have infinitely many orbits. Furthermore, all vertex stabilizers must be 
finite (see, for example [10, Lemma 2.3]), all orbits must be infinite by 
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Lemma 4.6, and G must have infinite motion by [10, Lemma 4.2]. Finally, 
we add without proof that all automorphisms must have finite order. 
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Added in Proof 


Problem 2 was solved by J. Carmesin, F. Lehner, R.G. Moller, who 
proved that every graph in I’ with linear growth and countable auto- 
morphism group has two ends. This means that the bound of Theorem 
4.9 always holds. The paper will be entitled “Tree decompositions and 
automorphism groups of 1-ended graphs”, and is still in preparation. 
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Abstract 

Let I be a dense countable subgroup of a locally compact continuous 
group G, and w a probability measure on I’. Two spaces of harmonic 
functions are naturally associated with mw: the space of u-harmonic 
functions on the countable group I’ and the space of u-harmonic func- 
tions seen as functions on G defined a.s. with respect to its Haar 
measure A. Correspondingly we have two natural Poisson boundaries: 
the I'-Poisson boundary and the G-Poisson boundary. Since boundaries 
on the countable group are quite well understood, a natural question is 
to ask how the G-boundary is related to the [-boundary. 

In this chapter we introduce a general technique that allows us to 
build the G-Poisson boundary from the I'-boundary. As an application, 
we determine the Poisson boundary of the closure of the Baumslag— 
Solitar group in the group of real matrices. In particular we show that, 
under suitable moment conditions and assuming that the action on R is 
not contracting, this boundary is the p-solenoid. 


Contents 


1. G-Harmonic Functions and G-Poisson Boundary 
2. From T’-Boundaries to G-Boundaries 

3. G-Poisson Boundary of Baumslag—Solitar Group 
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An important topic in the study of random walks on groups is the study 
of harmonic functions relative to a measure jz on a group G, i.e. of the 
functions f on the group such that 


FC) Se [ fondue. (4.1) 


The Poisson boundary is, in this setting, the measurable space that gives 
the integral representation of all bounded harmonic functions. This space 
encodes the asymptotic information contained in all random walk paths 
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of law uw. A natural problem is to determine when this space is trivial 
and, if it is not, to exhibit a geometric model. 

After the works of Blakwell, Choquet and Deny on abelian groups and 
the seminal papers of Furstenberg in the 1960s, much progress has been 
made on these questions. In particular when the harmonic functions live 
on a countable discrete group I’, a complete theory has been developed 
by Derriennic [5], Kaimanovich and Vershik [15], allowing us to construct 
the Poisson boundary (or at least decide whether it is trivial) for large 
classes of groups. 

In the more general case where the measure jz is supported on a locally 
compact group G, the situation is more complex and one has to decide 
on which space harmonic functions live. A natural choice is to consider 
harmonic functions as a subspace of the space L~(G,A) of essentially 
bounded functions with respect to the Haar measure A of the group. If 
the measure jz is spread out (and thus well adapted to the continuous 
structure) satisfactory general results have been obtained for Lie groups. 
The case where the measure p is not necessarily smooth, is far from being 
completely understood. Some results have been obtained for particular 
classes of groups (e.g. Nilpotent groups [10, 2], NA groups [17], etc.). 
Abstract constructions have also been proposed, but they do not allow 
us, in general, to construct geometric models for the boundary, nor to 
check whether it is trivial. I refer to the survey of M. Babillot [1] for 
a precise and complete overview of the subject and a more detailed 
bibliography. 

In antithesis to the case of a smooth measure, we may consider a 
purely atomic measure 44, supported on a countable subgroup I that 
we can suppose is dense in the continuous group G. In such a situation, 
harmonic functions can be seen both as functions on the discrete group 
I and as measurable functions on the continuous group G. 

When the Poisson boundary of the discrete group T is known (so 
that we can describe I'-harmonic functions), several natural questions 
concerning G-measurable harmonic functions arise: 


e Which T-harmonic functions can be extended to a G-harmonic 
function? 

e How are the T-Poisson boundary and the G-Poisson boundary 
related? 

e If we know how G acts on the I'-Poisson boundary, can we give 
conditions that imply there are no nontrivial G-harmonic functions? 


The goal of this manuscript is to investigate these questions. We 
are particularly interested in groups of matrices with rational entries, 
embedded as subgroups of groups of real matrices. In this case the 
Poisson boundaries of the countable subgroups are well understood [4], 
while there are still many open questions concerning the Poisson 
boundaries of the corresponding real groups (see section 1 for more 
detailed examples). 
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In section 2, we give a general construction of the G-Poisson boundary 
as a space of T-ergodic components in the product of G and the I- 
boundary (Proposition 2.3). We use this construction to determine the 
real boundary in the case of the Baumslag Solitar group BS(1,p), 
embedded as a dense subgroup of 


m 
lee | |imezoer} =R wz 


In particular, if w is dilating on R it is known that the BS(1, p)- Poisson 
boundary is the p-adic field Q, (thus there is no “real” component in 
the boundary); however the real Poisson boundary is not trivial and is 
given by the p-solenoid 


[0, 1) x Zp = (R x Qp) /ZQ/p) 


where the action of Z1/p) on R x Q, is the diagonal action 
(Corollary 3.1). 
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1. G-Harmonic Functions and G-Poisson Boundary 


This section is a brief introduction to measurable Poisson boundary, 
following Babillot [1] and Kaimanovich [13]. 


G-Harmonic Functions 


Let G be a locally compact second countable (thus metrizable and 
complete) group. Let 6 be the Borel o-algebra of G and A the right 
Haar measure. 

Let yt be a probability on G such that the closed semigroup generated 
by the support of yu is the whole group G. 

We say that a function f € L™©(G,A) is w-harmonic on (G,A) (or 
G-harmonic) if 


f(g= / figy)du(y) for A-almost all g € G. 
G 


We denote by H?°(G) the subspace of G-harmonic functions in 
DL™(G,A). 

It can be shown, using left convolution by identity approximations 
of G, that any f € H°°(G) is A-a.e. limit of harmonic functions that are 
left uniformly continuous on G. In this sense, the space of G-harmonic 
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functions is determined by the behavior of continuous ones. In particular, 
if all continuous harmonic functions are constant, then H°°(G) is trivial. 
We denote by AR ) the space of left uniformly continuous G-harmonic 
functions. 


Random Walks and Invariant Map 


Harmonic functions can be seen as asymptotic values of random walks 
in the following way. Let (Q,P) = (G,)N be the space of random steps 
and consider the right random walk 


Tn(@) = @1---@n. 


Let f be a bounded G-harmonic function. Notice that because the 
function f is defined only A-almost surely, the process f(grn(@)) is well 
defined only for A-almost all g. For this reason the starting point g has to 
be chosen according to p, a probability law on G with bounded density 
with respect to 4. Then the random process f(grn(@)) is well defined 
on the space (G x Q,o x P), and since f is harmonic, it is a bounded 
martingale. Thus the limit 


lim f(grn(@)) =: Z¢(g,@) exists p(dg)P(dw)-almost surely. (4.2) 
no 
Let T be the shift on Q; then is easily checked that 
Zi (g,@) = Z¢(go1, To) p(dg)P(dw)-almost surely 


that is, Zp is a bounded measurable invariant map on G x Q. In fact 
(4.2) defines an isometry of H?°(G) onto the subspace of measurable 
invariant maps of L°(G x w,p x P). The reverse map is given by 


[z(g) = E(Z(g,0)) p(dg)-almost surely. 


Poisson Transform and G-Poisson boundary 


Take a measurable space (X ,X,v) endowed with a measurable G-action 
and a p-stationary probability measure v. The Poisson transform 


Py 6 fO = [oo mau) 


maps any bounded function ¢@ in L°(X, p * v) to a w-harmonic function 
fg of HEL). 

Notice that the Poisson transform is not well defined as a map on 
L™(X,v). In fact, since v is not in general G-quasi invariant (i.e. g * v 
is not in general absolutely continuous with respect to v), two functions 
that coincide v-a.s. can have different images. 

If the Poisson transform is an isometry of L°(X, p*v) onto H?°(G) 
then we say that (X,v) is the (G, )-Poisson boundary. It can be shown 
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that the Poisson boundary is unique as a G-measurable space and that 
(X, p * v) is a Lebesgue space (cf. [1] propositions 2.26 and 2.28). 

If X is the G-Poisson boundary then there exists a measurable 
boundary map bnd : Q — X such that for every harmonic function 
f ¢ H2°(G) there exists pf ¢ L°(X, p * v) with 


of(g-bnd(w)) = lim f(grn(@)) — p(dg)P(dw) — a.s. 


(cf. [1] proposition 2.26. See also the proof of Lemma 2.1). Thus, 


[y= XG -a)dv(x) ~— p(dg) — as. and 
Zi (9,0) = of(g-bnd(@)) = p(dg)P(dw) — as. 


The p-invariant measure v on X is then the image of P under bnd. The 
boundary map is G-equivariant in the sense that bnd(w) = w;-bnd(T'w). 


Countable Group I 


Suppose now that the group G = T is countable. The Haar measure A 
is then the counting measure and one can choose p with nonzero mass 
in all elements g ¢ [. This means that all the equalities above hold for 
allgeTlr. 

In this particular case (and under the hypothesis that the support of 
ju generates I as a semigroup), the stationary measure v on X is I’-quasi 
invariant and P, is well defined on L°(X, v) itself. 

The fact that is absolutely continuous with respect to Ap is also 
fundamental for the study of Poisson boundary based on entropy [5, 15]. 
This complete theory has permitted us to determine a geometric model 
of the Poisson boundary for large classes of countable groups. 


Countable Subgroup [ of a Continuous G 


In this note we are interested in the case when the measure ju is supported 
on a countable subgroup [ of a continuous group G and in particular 
when I is dense in G. Then a continuous harmonic function f on G is 
uniquely determined by the values f(y) for y €¢ I’. Thus, f can also be 
seen as a T-harmonic function. In other words, the restriction to I is an 
isometric embedding of H~.(G) into H>°(I). In particular, if (X,v) is 
the ['-Poisson boundary then there exists @ in L°(X,v) such that 


r= | o(y-2) yer. 
xX 


However, in general there is no such integral representation for f(g) when 
g is not in I, as X is not a priori a G-space. 

In conclusion, the F-Poisson boundary contains in principle all the 
information about the G-Poisson boundary. But in order to extract this 
information one needs to answer two related questions: 
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e Determine the G-action on (an extension of) X adapted to the 
action of G on H>°(G). 

e Determine which are the functions in L~°(X,v) whose Poisson 
transform can be extended to G. 


Examples: Linear Groups with Rational Coefficients 

We are in particular interested in the case where the '-Poisson boundary 
is known, but G-harmonic functions are not completely understood. Here 
some examples. 


Affine Groups 
The real affine group Aff(R) is the group of real maps (b, a): © ax+b 
with a € R* and } € R; that is the group of matrices 


are ={| 5 | flee Rt ber} =R Ry. 


Harmonic functions on Aff(R) have been widely studied and some 
results are known also without continuity assumptions on the measure |. 
In particular, under the log-moment assumptions: 


‘(| log al) < oo and E(log* b) < 00, 


it is known that: 


e If Edog a) = 0 the Aff(R)-Poisson boundary is trivial ([18], see also 
[1, sect.4.5]). 

e If Edoga) < 0 the Aff(R)-Poisson boundary is R with the p- 
invariant measure v given by the law of 


(ee) 


Loo = Ss stn (4.3) 


1 


where (bn, Gp) are i.i.d. with law pw ({17], see also [1, thm 5.7]). 


If E(dog a) > 0 and the measure is spread out then the Aff(R)-Poisson 
boundary is trivial. But it is still not known what happens if E(log a) > 0 
and the measure jz is supported on a countable subgroup I’. 

On the other hand, using entropic criteria, the [-Poisson boundaries 
are well understood. If © = Aff(Q), the group of affine maps with rational 
coefficients, and under suitable moment conditions, the Aff(Q)-Poisson 
boundary is given by the product of the p-adic fields Q, where the sum 
(4.3) converges a.s., that is 


Qn, 


p:E(log | a|p)<0 


where we use the convention that Qoo = R (see [3}). 
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This property was first proved by V. Kaimanovich [13] in the case of 
the Baumslag-Solitar group 


+1 
BS(1, p) = (| a }) 
=) le m” |imnet qgez|=2(-) xz 


for some prime p. In this particular case, the BS(1, p)-Poisson boundary 
is R if Edog a) = —E(log |a|p) < 0 and Qy if E(log |a|y) = —Edog a) < 0. 

It is then natural to ask which harmonic functions can be extended 
to (continuous) harmonic functions of the closure of BS(1, p) in Aff(R), 
that is to 


p™ b 
Aff(p,R) = {| 0 1 ]imez ber} =Rwz. 

It turns out that, even if the BS(1, p)-Poisson boundary is Qy, the 
real Poisson boundary is not trivial. In Corollary 3.1 we will construct 
the Aff(p,IR)-Poisson boundary as a p-solenoid. 

The unpublished manuscript [16] of J.-F. Quint presents a similar 
example of a dynamic system acting in noncontacting way on the torus 
and constructs harmonic functions on the unstable variety. 

As we will see in Corollary 2.4 this kind of construction is possible as 
the action of BS(1,p) on Aff(p,R) x Q» has a discrete orbit. It is still 
not clear to me what happens when the action of I on the product of G 
and the [’-Poisson boundary is dense. 


Question. For instance, let Aff(1/2,1/3) be the countable subgroup 
generated by the affinities 


31 +1 941 +1 9™2 33 972 3N3 


Suppose E(log|a|o..) > 0; thus the [-Poisson boundary is equal to Qa, 
Q3 or Q2 x Q3 (according to the sign of E(log|a|2) and E(log|a|3) ) and 
has no real component. Is then the Aff(R)-Poisson boundary trivial? 


Semi-Simple Groups 

Similar questions arise for semi-simple groups. Take, for instance, a 
measure 4 supported on SL9(Q). Then the SZ2(Q)-boundary is the 
product of the Q,-projective lines for all primes p such that the support 
of w is not contained in a compact subgroup of SL2(Q,) (see [4]). In 
particular for 


P= SL9(Z(1/2)) 


=the i [lad = ed = 1,0,0,6.e¢ de 2/2" for some me 2} 
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the I'-Poisson boundary is P!(R) x P! (Qa). It is natural to expect that 
the SZ9(R)-Poisson boundary should be P!(R), however I am not aware 
of any proof of this fact. See also [1] section 1.7.4, for a similar example. 


2. From I-Boundaries to G-Boundaries 


Construction of a G-Action on a T-Space 


Let (X, X, v) be a P-measurable Lebesgue space equipped with a measure 
v that is T-quasi invariant. Suppose that I is contained in a locally 
compact group G. We want to construct a sort of minimal class of 
functions on X, on which G acts in such a way that the restriction 
to I of this action coincides with the I-action. 

Consider the product space (G x X,6 x X,p x v) and define the 
T-action on G x X 


y*(g,0) :=(gy',y +2). (4.4) 


Let 3 be the o-algebra of (I, *)-invariant functions of G x X that is 
the class of the functions @ such that p(dg) x v(dxr)-almost surely 


o(g,x)=o(gy yt) VyeTr. (4.5) 


The o-algebra J is complete because p x v is ([, *)-quasi invariant and 
I is countable. Rokhlin’s correspondence associates with the o-algebra 
3 a partition of G x X, such that the functions in 3 are constant on the 
elements of the partition. Because p is in the class of the Haar measure, 
we can choose the partition 7 to be G-equivariant, for the G-action on 
G x X given by left multiplication on the G component. In fact we have 
the following: 


Lemma 2.1. 


1. There exists a countable family {@n}nen of bounded functions dense 
in L'(Gx X,3,p xv) such that for any x € X the function on, 2) 
is continuous on G and such that (4.5) hold for all (g,4)€ Gx X. 
2. Let n be the partition defined by the equivalence relation 


(91, 21) ~ (92, 22) > on(v 91,71) = On(y 92,02) Vn EN andy ef. 
(4.6) 
Then n is a measurable partition (i.e. countably generated) and the 
associated complete o -algebra coincides with 3. In particular, for all 
$ € 3 there exists @ defined on XG x X/n such that ¢ = don 
p X v-a.s. 
3. If G acts on Gx X by the left multiplication on the G component, 
then such a partition is G-equivariant, i.e. 


go -n(g,2) = n(gog, z) Vou,.g€ Gandre X. 
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Proof. 1. Because (G x X,3, x v) is a Lebesgue space, there exists 
a countable family {g;} of bounded functions dense in the L!-norm. Set 
yi(g,x) = 0 on the set of the (g,2) on which (4.5) does not hold, in 
order to obtain a family of functions *-invariant everywhere. 

Take an approximation of the identity on G, ie. a sequence of 
nonnegative continuous functions a7, whose supports shrink to the 
identity e and such that ||an||j = 1. Let 


oP (9,2) = i: on(h)pi(hg, 2)dX(h). 


It is easy to check that the g? are still *-invariant. By classical results, 
for any x € X, the functions g?(-,x) are continuous and converge to 
gi, c) in L'(G, p) when n goes to oo. Then 

bn, Jet — en” = im f letc.m —ec-2f doe 


n> oco N— 0O 
=f jim |wr,a) — 9G, a)]f dove) =0 
oe no 
since lore, L) — iC, Z) 4 is bounded by 2|| 9; er" : 
Thus {@}’}in is a countable family of G-continuous functions dense in 
L(G x X,3,p x v). 
2. Let {J;} be a countable family of intervals of R that separates the 
points and let 


B(n, i) = 671i) © Gx X. 


Then the partition n defined in (4.6) is generated by {y B(n,i)|y € T, 
n,t € N}; in fact 


mag= () yB@a ()\ Bae. 
(g,x)ey B(n,t) (9,2) ¢y B(n,t) 


Given that by step 1 the complete o-algebra generated by the sets B(n, 7) 
is J, by Rohlin’s correspondence, we can conclude that any function of 
3 is almost surely constant on the elements of the partition. 

3. Finally to prove G-equivariance of n, we need to verify that for 
every g0,g € G and x € X, we have that (9', 2’) € n(g, x), ie. 


dnivg’, 2’) =onlyg,c) YWneNandy eT 
if and only if go - (g’, x’) = (gog’, x’) € n(gog, 2), i.e. 
bn(vog0g9', t') = bn(ygog,4) Vn e Nand yer. 


This follows from the fact that the functions ¢, are G-continuous and 
that T is dense in G, letting y > yogo (resp. yo > V9"): 
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If 7 is defined as in (4.6), let 
X=GxX/n 
be the space of the 7 components. The projection n: G x X > X defines 
a natural o-algebra on X 


F={AcGx XA) eBx 3]. 


By the previous lemma the completion of n (3) is J. 
We have just proved that X has a natural structure of a G-space: 
90 -n(g,x) = n(gog, 2). (4.7) 
Because the functions ¢, are *-invariant everywhere, 7 allows us to 
“transfer” the P-action from G to X, in the sense that 
ngy, c)=ng,y-%) Vg,tEeGxX. 


In particular, the action of T on X and on its projection on X are 
related by 


y-n(e,£) = nly, t) =n(e,y +2). 
Let DV = n,(5¢ X v) be the image on X of the measure be X v by n; 
that is, 
5G) = | Gale.eyav(a. (4.8) 


By (4.7) we have then that the image by 7 of p x v is 0 *V: 


nex Doe / 3(n(g, 2))dp(g)dv(z) 


GxxX 


= / o(g -n(e, x))dp(g)dv(2) 
GxX | 
= p*v(d). 


In conclusion, 


Corollary 2.2. The_projection n induces an isometry between L(G x 
X,3,p x v) and L*(X,5,p *V). 


In the next section, using this measure theoretical construction, we 
will build the G-boundary on the T'-boundary and prove that, if the 
x-action has a fundamental domain, this fundamental domain is the 
G-boundary. However, it is not clear to me how to construct a geometric 
model of this measure space when the I’-action is “dense.” 

An interesting case is, for instance, when G acts on X and this action 
coincides with the I’-action. Then L°(X, p * v) embeds isometrically in 
L@(G x X,3,o x v). In fact, if we [D°(X,p * v), then 


oy (9,2) = W(g- 2) 
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is clearly *-invariant and this embedding is an isometry since 
1/p 
lo = im, ( f (0.2) o(dayv«ae)) 


1/p 
= im (/ wi? = vay) = [Vloo. 


Question. However, it is not clear under which conditions this map is 
surjective, that is when X coincide with X. 

For instance, as a toy model, take G = (R,+), X = Rand Il =Q. For 
which measure v does X = R? This is true by if v is a.c. with respect to 
the Lebesgue measure, but what happens for other measures? 

What happens if G = SLo(R) X = P(R) and T = SI9(Z(1/2)) 
(or T = SL2(Q))? 


From T-Boundaries to G-Boundaries 


Suppose that the measure v on X is y-stationary. For every bounded 
function @ in L*(G x X,p x v), define the Poisson transform: 


Pv: GP fom = [ 60g. 20av¢eten A(dg)-almost all g. 


If ¢ € 3 then fy is a bounded p-harmonic function on L°(G,A), as 
required. Indeed 


f(g) = f o.2va0) = f og.7-2yv(dryucdy) = 


= [ oy.2w(asyntay) = f oorucay). 


The following proposition shows that all G-harmonic functions can be 
written in such a way. 


Proposition 2.3. If (X,v) is the Poisson boundary of (Tw) then for 
every L-harmonic function f in G, there exists a bounded function @ € 3 
such that f = fg in L°(G,A). 

In this case Py is an isometry from L°(G x X,3,p xv) onto AY (G). 
In other words, (X,¥V) is the G-Poisson boundary. 


Proof. Let w € (Q,P) = (TN, w®%) and rp, = ry(@) = @1---@p, be the 
right random walk on I of law yw. The process f(grz(@)) is a bounded 
martingale on the space (G x Q, o x P); thus it converges almost surely. 
If bnd Q > X is the boundary map 


Tim, f(gr(@)) = 6(g, bnd(@)) (4.9) 


p x P-almost surely. Thus ¢(g, bnd(w)) is G x Q measurable and, since 
v= bnd~!P, the function ¢(g,z) is G x X-measurable. Furthermore, 


A CONSTRUCTION OF THE MEASURABLE POISSON BOUNDARY 131 
given that T is countable, for p x P-almost all (g,@) 
dim FY Tk) = d(gy,bnd(w)) forally Ef. 
Since X is a u-boundary, notice that 
@ bnd(T'w) = bnd(a) 


where T is the shift on Q. Take y, in the support of w, then the event 
Y1 = @, has a positive measure and is conditioned to this event 


(97, ',yibnd(To)) = $(gy; ', bnd(o)) 
= lim f(y 'yitn(To)) = o(g, bnd(T)). 


Because Tw is independent of w; and of same law as w and that the 
support of « generates I, we can conclude that ¢ € J. 
Lastly, let us check that the Poisson transform is an isometry. In fact. 


1/p 
Wl = tim (f uooraoa) 


1/p a 
< tim (f fou.0P ado) = 012”. 


On the other hand by the bounded convergence theorem 


l/p 
ioig” =(f [1a nPavac) 

1/p 
(f [i tim, torso? aCordpca) 


1/p 
Jim, ([ [ vernon ap arco) 


1/p 
< ( / (W/IPaPo) ) = If 


since p is G-quasi invariant. 


A 


G-Poisson Boundary as I-Ergodic Diagonal Components 


Another way to express the result of Proposition 2.3 is to say that the 
G-Poisson boundary coincides with the space of ergodic components of 
I on (G x X) with respect to the action * defined in (4.4). 

Observe that the action * is, in reality, the standard left diagonal 
action of f on G x X: 


In fact the two actions are conjugated by the map m : (g,2)  (g7!, 2), 


that is an isomorphism of the measure space of (G x X,p x v) that 


SARA BROFFERIO 132 


preserves the class of measure. Thus the space L°(G x X,3,o x v) 
coincides (via 2) with the space of bounded functions of (G x X,p x v) 
that project on T\(G x X). In particular, the G-Poisson boundary is 
trivial if and only if the (diagonal) action of T on (G x X,p x v) is 
ergodic. 

Conversely if the action of [ on G x X is “measurably discrete,” that 
is, there exists a fundamental domain A, then it is possible to identify 
the G-Poisson boundary with this geometric model: 


Corollary 2.4. Suppose there exists a measurable fundamental domain 
A € 6x for the action x of f on Gx X (or equivalently for the diagonal 
action) that is 

epxvTx«A)=1 

e px v(An Were yxwA)=0. 
Let D be the restriction of the o-algebra 6x X to A. Then L®(A,D, pxv) 


is isometric to L°(G x X,3,0 x v). The measurable space (A,D) with 
the induced G-action 


90 * $(9,) = >) b(gogy |. y- t)1a(gogy |. + 2) 
yer 


for all ¢ € L*(A,D, p x v) and the p-invariant measure defined by 


Vd) = yf oorhy -r)la(y~*,y + 2)v(de) 


yer 
is the G-Poisson boundary. 


Proof. The map 
APTP«A 


induces an isometry of L°(A,®,p x v) onto L(G x X,35,p x v). 

In fact, if A is a nontrivial set of A then x A is a non trivial set of 3. 
Clearly [* A € 3 and it has nonzero measure. Let B C A be a nontrivial 
set such that p x v(AN B) = 0. We claim that p x vp *« ANT « B) = 0; 
in fact, the measure p x v being quasi-invariant p x v(y* ANy * B) =0 
and if y1 # y2 


pxvyixANye*B)<pxvi[yxl{An LU vxA]] =o. 
yeP—{e} 


The isometry is surjective. Let I € 3, then we claim that J = [*(UNA). 
Indeed 


Pen A= LvxtnysAa|JUnyxd) HINT a. 
Y Wa 


A CONSTRUCTION OF THE MEASURABLE POISSON BOUNDARY 133 


Observe that if A C A then 


Irea(g,t) = > la(gy y+ 2) 
yer 


and the sum has only one term for p x v-almost all (g, x) . It can easily 
be seen that the projection of v on D is 


HA) = Of taQ 7} y- a)v(de) = v0 # A). 


yeD 


3. G-Poisson Boundary of Baumslag—Solitar Group 


Corollary 3.1. Let p be a prime number and consider the Baumslag- 


Solitar group 
+1 ey 


Let ww be an irreducible measure on BS(1,p) with first logarithmic 
moment on R and Qy. Suppose that 


ie [ lbglaG)ipauty)-<0 


where y = ae | that is the BS(1,p)-Poisson boundary is 
X = Qp. Let 


m 
ago.R)={[% | |imezseR}=Rwz 


be the closure of BS(1, p) in Aff(R). Then the Aff(p,R)-Poisson bound- 
ary is the p-solenoid: 


A = {(g, 2) € Aff(R) x Qpla(g) = 1,0 < b(g) < 1; |zlp < 1} = [0,1) xZp, 
equipped with the Aff(p,R)-action on @ € L(A, p x v): 


(b,p™) (00,0) = D> 1a -b(p™2o0+b— B, pty + B), 
BeZ(1/p) 
and the invariant measure 


v@= YL [ 68.2 - prix, p(erv(aey. 


BeZ(1/pyn[0,1) 
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Proof. We just need to prove that A is a fundamental domain. In fact, 
for any x € Qp, let a(x) € Z(1/p) such that |x — a(x)|p < 1. The choice 
of @ is unique up to the sum with an integer. It is easily checked that, 
for every (6,2) € R x Qp, the unique k € Z(1/p) such |x + k|y < 1 and 
b—k e€ (0,1) isk =[b+a(2)] — a(x). Thus 


yx((b,p™), ct) € AS y= ((b+a(p™z)| —a(p™ x), p™). 
Oo 


To illustrate how the previous corollary can be used to study the 
behavior of harmonic functions on BS(1, p), consider, for example, 


(9, £) = 1fo,1)x(1y(M 1 pz, () 


and the associated harmonic function: 


f(b, p™) = / Y>  ljo(b— B)1pz,(p™ 2 + B)v(de). 


BeZ(1/p) 


Then we have 


e f is periodic of period p on the b coordinate 


f(pk+b,p™) =f Y> loay(b—B)1pz, (px + B + pk)v(dz) 
BeZ(1/p) 
= f(b, p™) 
e limms+oof(b,p™) = 1 if b € [0,1) + pZ. In fact |[f lo = 1 and 
b € [0,1) 


f(b,p™) > / 1jo,1) @)1pz)(P™2)v(dr) = v(p!-™Zp) + 1 


when m— +00 
e limm—++oo f(b, p™) = 0 if b Z [0,1) + pZ in fact 


F(b,p™) = » 1io,1) (0 ar B)1pz, (ole + B)1pi-mz, (x)v(dx) + 
BeZ(1/p) 
Pv 25) 
< > lpoay(b—B)1pz, (6) + A — v(p!"™Zp)) 
BeZ(1/p) 
= ae 1jo,1) (0 — pk) + A — v(pi-™ Zn) 
keZ 
= 1fo,t+pz(b) + A — v(p!-™Zp)). 
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Abstract 

A self-contained account of the theory of structure trees for edge cuts in 
networks is given. Applications include a generalisation of the Max-Flow 
Min-Cut theorem to infinite networks and a short proof of a conjecture 
of Kropholler. This gives a relative version of Stallings theorem on the 
structure of groups with more than one end. A generalisation of the 
Almost Stability theorem is also obtained, which provides information 
about the structure of the Sageev cubing. 
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1. Introduction 


Let X be a connected graph. A subset A of the vertex set VX is defined 
to be a cut if 5A is finite. Here 5A is the set of edges with one vertex in 
A and one vertex in A* = VX — A. A ray R in X is an infinite sequence 
21, %2,... of distinct vertices such that x, 241 are adjacent for every 7. 
If A is an edge cut and R is a ray, then there exists an integer N such 
that for n > N either z, € A or rz, € A*. We say that A separates rays 
R = (ap), R' = (a;,) if for n large enough either zp, € A,zj, € A* or 
tn € A*, a), € A. We define R ~ R’ if they are not separated by any edge 
cut. It is easy to show that ~ is an equivalence relation on the set ®X of 
rays in X. The set QX = ®X/ ~ is the set of edge ends of X. An edge 
cut A separates ends w, a’ if it separates rays representing w, a’. A cut 
A separates an end w and a vertex v € VX if for any ray representing 
w, R is eventually in A and v € A®* or vice versa. 
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The number e(G) of a finitely generated group is the number of ends 
of a Cayley graph of X with respect to a finite generating set S. It turns 
out that e(G) does not depend on which generating set S is chosen, 
and that it is always one of 0, 1,2 or the cardinal number c. If a finitely 
generated group G has more than one end, then there is a cut A C G 
(the vertex set of any Cayley graph), which separates two rays. Thus 
both A and A* are infinite. The fact that 6A is finite is equivalent to the 
fact that the symmetric difference A+ As is finite for each s € S, and 
it is not hard to see that this is equivalent to requiring that A+ Ag is 
finite for every g € G. A set A with these properties is called a proper 
almost invariant set. Thus a subset A of G is said to be almost invariant 
if the symmetric difference A+ Ag is finite for every g € G. In addition 
A is said to be proper if both A and A* = G — A are infinite. Clearly 
the finitely generated group G has more than one end if and only if it 
has a proper almost invariant subset. This provides a way of extending 
our definition to arbitrary groups. We say that a group G has more than 
one end if it has a proper almost invariant set. 


Theorem 1.1. A group G contains a proper almost invariant subset 
(i.e. it has more than one end) if and only if it has a non-trivial action 
on a tree with finite edge stabilizers. 


This result was proved by Stallings [31] for finitely generated groups 
and was generalized to all groups by Dicks and Dunwoody [3]. The action 
of a group G on a tree is trivial if there is a vertex that is fixed by all 
of G. Every group has a trivial action on a tree. 

Let T be a tree with directed edge set ET. If e is a directed edge, then 
let € denote e with the reverse orientation. If e,f are distinct directed 
edges, then write e > f if the smallest subtree of T containing e and f 
is as below. 


e f 


RIS Pe 


Suppose the group G acts on T’.. We say that g shifts e if either e > ge 
or ge > e. If for some e € ET and some g € G, g shifts e, then G acts 
non-trivially on a tree JT’. obtained by contracting all edges of T not in 
the orbit of e or e. In this action there is just one orbit of edge pairs. 
Bass-Serre theory tells us that either G= Gy*g, Gy where u, v are the 
vertices of e and they are in different orbits in the contracted tree Te, 
or G is the HNN-group G = Gyx*q, if u,v are in the same G-orbit. If 
either case occurs we say that G splits over Ge. 

If there is no edge e that is shifted by any g € G, (and G acts without 
involutions, i.e. there is no g € G such that ge = @), then G must fix a 
vertex or an end of T. If the action is non-trivial, it fixes an end of T, i.e. 
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G is a union of an ascending sequence of vertex stabilizers, G = J Gy, 
where vj, v2,... is a sequence of adjacent vertices and Gy, < Gy <... 
and G # G,,, for any n. 

Thus Theorem 1.1 could be restated as 


Theorem 1.2 ((31], [3]).. A group G contains a proper almost invariant 
subset (i.e. it has more than one end) if and only if it splits over a finite 
subgroup or it is countably infinite and locally finite. 


If a group splits over a finite subgroup, then it is possible to choose 
a generating set S so that the Cayley graph has more than one end. 
However, for a countably infinite locally finite group there is no Cayley 
graph with more than one end. 

The if part of the theorem is fairly easy to prove. We now prove a 
stronger version of the if part, following [2]. 

Let H be a subgroup of G. A subset A is H-finite if A is contained 
in finitely many right H-cosets, i.e. for some finite set F, A C HF. A 
subgroup K is H-finite if and only if HM K has finite index in Kk. Let 
T be a G-tree and suppose there is an edge e and vertex v. 

We say that e points at v if there is a subtree of T as below. We write 
e>v. 


€ U 


I ee 


Let Gle, v] ={g € Gle > gu}. 

If h € G, then Gle, vJh = Gle, hv], since if e > gu,e > gh(h—!v). 

It follows from this that If K = G,, then Ge, v]|K = Gl[e, v]. Also if 
H = Ge, then HGl[e, v] = G[e, v]. 

Ifv =ce, then Ge = H < K = Gy, andif A= Gle,ve|, then A = HAK. 


U € 
o—__>_ 


Consider the set Az,x ¢ G. If g €¢ A,gxu ¢ A, then e > gu,é > gav. 
This means that e is on the directed path joining grv and gv. This 
happens if and only if g te is on the path joining sv and v. There 
are only finitely many directed edges in the G-orbit of e in this path. 
Hence g~! € FH, where F is finite, and H = Ge, and g € HF~!. Thus 
A-—Ar~! = HF-!, i.e. A— Ax! is H-finite. It follows that both Ax — A 
and A — Az are H-finite and so A+ Az is H-finite for every x € G, ice. 
A is an H-almost invariant set. 

If the action on T is non-trivial, then neither A nor A* is H-finite. We 
say that A is proper. 

Peter Kropholler has conjectured that the following generalization of 
Theorem 1.1 is true for finitely generated groups. 
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Conjecture 1.3. Let G be a group and let H be a subgroup. If there 
is a proper H-almost invariant subset A such that A= AH, then G has 
a non-trivial action on a tree in which H fixes a vertex v and every edge 
incident with v has an H-finite stabilizer. 


We have seen that the conjecture is true if H has one element. The 
conjecture has been proved for H and G satisfying extra conditions 
by Kropholler [22], Dunwoody and Roller [15], Niblo [26] and Kar and 
Niblo [20]. 

If G is the triangle group G = (a, bla? = b® = (ab)’ = 1), then G 
has an infinite cyclic subgroup H for which there is a proper H-almost 
invariant set. Note that in this case G has no non-trivial action on a 
tree, so the condition A = AH is necessary in Conjecture 1.3. 

A discussion of the Kropholler conjecture is given in [28]. I first learned 
of this conjecture in a letter Peter wrote to me in January 1988, a page 
of which is shown here. 

We give a proof of the conjecture when G is finitely generated over 
H, i.e. it is generated by H together with a finite subset. 
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Iam very grateful to Peter Kropholler for enjoyable discussions and a 
very helpful email correpondence about his conjecture. 

The main tool in proving Conjecture 1.3 is the theory of structure trees 
in connected graphs, which was initiated in [8] and [3]. In the next section 
a fairly self-contained account of this theory is given. In fact the theory 
is extended to apply to networks, and it is shown that the sequence of 
structure trees obtained for a network is uniquely determined. It is this 
property that is crucial in proving the Kropholler conjecture. 

Another very interesting aspect of extending the theory to networks 
is that one can obtain non-trivial result for a finite network. Results 
for finite networks such as the Max-Flow Min-Cut Theorem (MFMC) 
and the existence of a Gomory-Hu tree are shown to be special cases of 
our results for more general networks. It is also the case that Stallings 
theorem on the structure of groups with more than one end also follows 
from the theory developed here. It is very pleasing (to me at least) 
that there is a theory that includes both the Stallings theorem and the 
MFMC. 

If A is an almost invariant set, and M = {B|B =, A} so that for 
B,C € M,B+C = F where F is finite, then the Almost Stability 
theorem of [3] shows that M is the vertex set of a G-tree T. 

We can define a metric on M. For B, C € M define d(B, C) = |B+C|, 
and this is a geodesic metric on T. In the final section a generalisation 
of this result is proved. If H is a subgroup of G, and A = HAH is an 
H-almost invariant subset, we now put M = {B|A+ B= HF} where F is 
finite, then M is aright G-set, i.e. it admits an action of G by right mul- 
tiplication. If for B, C € M we put d(B, C) to be the number of cosets 
Ax in B+C, then we have a metric on M as before. It is again shown that 
this is a metric on a tree, thus giving the H-almost stability theorem. 

At the risk of appearing self-indulgent, I record the history of the 
theory of structure trees. 

As noted above, in his breakthrough work [31] on groups with more 
than one end, in the late 1960s, Stallings showed that a finitely generated 
group has a Cayley graph (corresponding to a finite generating set) with 
more than one end if and only if it has a certain structure. At about that 
time Bass and Serre (see [3] or [30]) developed their theory of groups 
acting on trees and it was clear that the structure of a group with more 
than one end, as in Stallings theorem, was associated with an action 
on a tree. In [7] I gave a proof of Stallings’ result by constructing a 
tree on which the relevant group acted. This involved showing that if 
the finitely generated group G had more than one end, then there is a 
subset B C G such that both B and B* are infinite, 6B is finite and the 
set E€ = {gB\g € G} is almost nested. Here we define a cut in a graph 
X to be a subset A of VX such that 6A if finite, where 6A is the set of 
edges with one vertex in A and one vertex in A* = VX — A. The set of 
all cuts is denoted BX. 
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A set € of cuts is almost nested if for every A,B € E at least one 
corner of A and B is finite. A corner of A,B is one of the four sets 
AN B, A* 0 B,AN B*, A*N B*. 

In [8] I gave a stronger result by showing that if a group G acts on 
a graph X with more than one end, then there exists a subset B €¢ BX 
such that B and B* are both infinite and for any g € G the sets B 
and gB are nested, i.e. at least one of the four corners is empty. The 
set of all such gB can be shown to be the edge set of a tree, called a 
structure tree. 

This result was further extended by Warren Dicks and myself [3]. In 
Chapter 2 of that book it is shown that for any graph X the Boolean 
ring BX has a particular nested set of generators invariant under the 
automorphism group of G. At the time I thought that the result when 
applied to finite graphs was of little interest. This was partly because 
an action of a group on a finite tree is always trivial, i.e. there is always 
a vertex of the tree fixed by the whole group. This is not the case for 
groups acting on infinite trees: the theory of such actions is the subject 
matter of Bass-Serre theory. Also for a finite graph X, there is always 
a nested set of generators for BX consisting of single elements subsets. 
The belated realisation that the theory developed in [3] might be of some 
significance for finite networks occurred only recently. 

In 2007 Bernhard Kron asked me if one could develop a theory of 
structure trees for graphs with more than one vertex end rather than 
more than one edge end. These are connected graphs that have more 
than one infinite component after removing finitely many vertices. We 
were able to develop such a theory in [14]. In the course of our work on 
this, we realised that we could develop a theory of structure trees for 
finite graphs that generalised the theory of Tutte [33], who obtained a 
structure tree result for two-connected finite graphs that are not three- 
connected. The theory for vertex cuts is more complicated than that for 
edge cuts. In 2008 I learned about the cactus theorem for min-cuts from 
Panos Papasoglu. This theory, due to Dinits, Karsanov and Lomonosov 
[5] (see also [16]) is for finite networks. It is possible, with a bit more 
work, to deduce the cactus theorem from the proof of Theorem 2.2. 
Evangelidou and Papasoglu [19] have obtained a cactus theorem for edge 
cuts in infinite graphs, giving a new proof of Stallings theorem. In [4] 
Diekert and Weiss gave a definition for thin cuts, which is equivalent 
to the one given in [3], but which made more apparent the connection 
with the Max-Flow Min-Cut theorem. I also had a very helpful email 
exchange with Armin Weiss. Weiss told me about Gomory-Hu trees that 
are structure trees in finite networks. 

Thinking about these matters finally led me to think about structure 
trees for edge cuts in finite graphs and networks and the realisation that 
the theory developed in [3] might be of some interest when applied to 
finite networks. 
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In Section 2 the theory for finite networks is recalled. The theory is 
then generalised to arbitrary networks. For any network N we obtain 
a canonically determined sequence of trees T, that provide complete 
information about the separation of a pair s,¢t where each of s and t is 
either a vertex or an end of X. It is possible to obtain all such information 
from a single tree T, if X is accessible. A graph is accessible if there 
is an integer n such that any two ends can be separated by removing 
at most n edges. This definition is due to Thomassen and Woess [32]. 
Other ways of defining accessibility of graphs are discussed. There are 
locally finite vertex transitive graphs that are inaccessible. Such graphs 
are constructed in [9] or [10]. 

The situation for edge cuts contrasts with the situation for vertex cuts. 
Thus there is a canonically determined sequence of trees that separates 
a pair s,t in the set of vertices or ends of the graph X. For vertex cuts, 
one can only find a canonically defined structure tree that separates a 
pair x-inseparable sets or a pair of vertex ends, where x is the smallest 
integer for which it is possible to separate such a pair. 

Structure tree theory has been used by several authors to classify 
infinite graphs that have more than one end and which satisfy different 
transitivity condition. For example, Macpherson [24] used a structure 
tree to classify infinite locally finite distance transitive graphs, and Moller 
[25] used these methods to classify infinite ended locally finite graphs for 
which the automorphism group acted transitively on the ends. In [32] 
Thomassen and Woess obtain a number of results using structure trees. 
They show, for example, that if r is prime, then a connected, r-regular, 
1-transitive graph with more than one end, is a tree. 

This chapter incorporates two papers [12] and [13] that have appeared 
on arXiv. I am very grateful to Peter Kropholler and Armando Martino 
who made a careful study of the earlier papers. I have included their 
suggestions and corrections in this version. I thank Alex Margolis for his 
careful comments and suggestions. 


2. Networks and Structure Trees 


2.1. Finite Networks 


In this subsection we define our terminology, but restrict attention 
to networks based on finite graphs. We recall the Max-Flow Min-Cut 
theorem and state the result that our more general theory gives for finite 
networks. We illustrate the theory with examples. 

We define a network N to be a finite simple, connected graph X and 
amapc: EX => {1,2,...}. 

Let s,t € VX. An (s,t)-flow in N isa map f: EX — {0,1,2,...} 
together with an assignment of a direction to each edge e so that its 
vertices are ve and te and the following holds. 
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(i) For each e € EX, f(e) < c(e). 

(ii) If we put fT(v) = Uf (eee = v) and f7(v) = U(f(e)|te = v), 
then for every v € VX,vu 4 s,v &t, we have ft(v) = f~(v). That 
is, at every vertex except s or t, the flow into that vertex is the 
same as the flow out. 


It is easy to show that in an (s, t)-flow, f(s) — f7(s) = -—(fT() — 
f—~@)). The value of the flow is defined to be |f| = |ft(s) — f7(s)|. We 
define a cut in X to be asubset A of VX, AAV,A A VX. If A isa cut, 
then so is its complement A*. If N is a network and A Cc VX is a cut, 
then the capacity c(A) of A is the sum c(A) = X{c(e)le = (u,v), u € 
A.v € A*}. We define 6A to be the set of edges with one vertex in A and 
one in A*, so that c(A) is the sum of the values c(e) as e ranges over the 
edges of 6A. We could replace each edge e of X with c(e) edges joining 
the same two vertices and then have a theory in which the capacity of a 
cut is the number of edges in 6A. 

In Figure 5.1 a network is shown, together with a max-flow (which has 
value 7), together with a corresponding min-cut. 


Theorem 2.1 (The Max-Flow Min-Cut theorem [17]).. The maximum 
value of an (s,t)-flow is the minimal capacity of a cut separating s 
and t. 


In the proof of this result it is shown that one obtains a min-cut from 
a max-flow as the set of vertices that are connected to s by a path in 
which each edge has some unused capacity. Thus in Figure 5.1, bottom 
diagram, the min-cut vertices are the ones to the left of the dotted line 
and are shown in bold. 

In this chapter it is shown that for any finite network there is a 
uniquely determined network based on a structure tree that provides a 
convenient way of encoding the minimal flow between any pair of vertices. 
Specifically we will prove the following theorem. 


Theorem 2.2. Let N(X) be a finite network. There is a uniquely 
determined network N(T) based on a tree T and an injective map 
v: VX > VT, such that the maximum value of an (s,t)-flow in X 
is the maximum value of a (vs,vt)-flow in N(T). Also, for any edge 
e’ € ET, there are vertices s,t € VX such that e’ is on the geodesic 
joining vs and vt and c(e’) is the capacity of a minimal (s, t)-cut. 


An example of a network and its structure tree are shown in Figure 5.2. 
Thus in this network the max-flow between u and p is 12. One can read 
off a corresponding min-cut by removing the corresponding edge from the 
structure tree. Thus a min-cut separating u and p is {q,r,s,t, u,v, w}. 
The map v need not be surjective. In our example there is a single vertex 
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Figure 5.1 Max-Flow Min-Cut theorem 


z that is not in the image of v shown in bold. One can get a structure 
tree for which v is bijective by contracting one of the four edges incident 
with this vertex. The tree then obtained is a Gomory-Hu tree [18]. 
The structure tree constructed in the proof of Theorem 2.2 is uniquely 
determined and is therefore invariant under the automorphism group of 
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Figure 5.2 Network and structure tree 
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the network. The tree obtained by contracting one of the four edges is 
no longer uniquely determined as one gets a different tree for each of the 
four choices. In some cases this would mean that the structure tree did 
not admit the automorphism group of the network. Thus, for example, if 
the automorphism group of X is transitive on VX and c(e) = 1 for every 
edge, then the structure tree would have n vertices of degree one, where 
n = |VX| and one vertex of degree n. Clearly this structure tree will 
admit the automorphism group of X, but if one edge is contracted to get 
a tree with n vertices, then the new tree will not admit the automorphism 
group. 

Not every min-cut separating a pair of vertices can be obtained 
from the structure tree. The min-cuts obtained are the ones that are 
optimally nested with the cuts of equal or smaller capacity. In our 
example there are four cuts of capacity 12 corresponding to edges in 
the structure tree incident with z. However, there are other cuts of 
capacity 12. Thus there are two min-cuts in the structure tree separating 
k and h, but there are in fact four min-cuts separating k and h. In [5] 
it is shown that the min-cuts separating two vertices correspond to the 
edge cuts in a cactus, which is a connected graph in which each edge 
belongs to at most one cycle. The cactus of min-cuts separating k and h 
is a 4-cycle. 


2.2. The Algebra of Cuts 


Let N be a network based on the graph X. We now allow X to be infinite. 
Thus N is a simple connected graph with a map c: EX — {1,2,...}. 
If A is a cut, i.e. a subset of VX for which 6A is finite, then let c(A) = 
X{c(e)|e € 5A}. Note that we do not assume that X is locally finite. It 
is convenient from here on to allow @ and VX to be cuts. Thus the set 
of cuts is a Boolean ring BX. 

A ray Rin X is an infinite sequence 21, 22,... of distinct vertices such 
that 2;,%j41 are adjacent for every i. If A is an edge cut, and R is a 
ray, then there exists an integer N such that for n > N either tr, € A 
Or Ln € A*. We say that A separates rays R = (a), R’ = (ay) if for 
n large enough either z, € A, € A* or a € A*, az), € A. We define 
R ~ R’ if they are not separated by any edge cut. It is easy to show 
that ~ is an equivalence relation on the set ®X of rays in X. The set 
QX = ®X/ ~ is the set of edge ends of X. An edge cut A separates 
ends w,a’ if it separates rays representing w,qw’. A cut A separates an 
end w and a vertex v € VX if for any ray representing w, R is eventually 
in A and v € A* or vice versa. 

A cut A is defined to be thin with respect to u,v € VX UQX if it 
separates some u,v € VX UQX and c(A) is minimal among all the cuts 
that separate u and v. A cut is defined to be thin if it is thin with respect 
to u,v for some u,v € VX UQX. 
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A cut A is defined to be tight if both A and A®* are connected, i.e if 
x,y € A then there is a path joining x, y whose vertices are all in A, and 
similarly for A*. 


Proposition 2.3. A thin cut is tight. 


Proof. Let A be thin with respect to u,v. It is easy to see that if 
A separates u,v then some component C of A or A* must separate 
u,v. If C is a component of A, then 6C Cc 6A and if C,D are distinct 
components then 6C' and 6D are disjoint. Thus if A is thin then C = A. 
The result follows. 


It is shown in [8] that there are only finitely many tight cuts C' with a 
fixed capacity such that 6C contains a particular edge. The proof of this 
in [32] is neater and it is reproduced here for completeness. By replacing 
each edge with capacity c(e), by c(e) edges joining the same pair of 
vertices, we can assume that every edge has capacity one. 


Proposition 2.4. For any e € EX, there are only finitely many tight 
cuts A with |SA| = c(A) =k such that e € 6A. 


Proof. The proof is by induction on k. For & = 1 there is nothing to 
prove. So assume k > 1. We can assume that e = zy is in some tight 
k-cut, i.e. a cut A such that 5A has k edges. Hence X — e has a path 
P from z to y. Now every tight k cut that contains e also contains an 
edge of P. By the induction hypothesis there are only finitely many tight 
(K — 1)-cuts in X — e containing an edge of P, and we are done. 


If A, B are cuts, then the sets AN B, A* 1 B, A*N B, A*N B, AN B* 
are also cuts. These sets are called the corners of A,B. This term is 
suggested by Figure 5.3. Two corners are called opposite or adjacent as 
suggested in this figure. We say two cuts A, B are nested if one AN B, 
A* 0 B, A* 0B, A* 1B, AN B* is empty. A set E of cuts is said to be 
nested if any two elements of € are nested. 

Two cuts that are not nested are said to cross. 


Lemma 2.5. Let B be a cut. There are only finitely many tight cuts 
A with capacity n that cross B. 


Proof. Let F be a finite connected subgraph of X that contains 6B. 
If A crosses B, then both A and A* contain a vertex of 6B. Hence F’ 
contains an edge of 6A. The lemma follows from Proposition 2.4. 


We consider sets € satisfying the following conditions: 


(i) If Ae €, then A* € E. 
(ii) The set € is nested. 
(iii) If A,B € € and A C B, then there are only finitely many C € € 
such that AC CC B. 


The following result was first obtained explicitly in [7]. 
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Theorem 2.6. Jf € is a set satisfying conditions (i), (ti) and (iii), 
then there is a tree T = T(E) such that the directed edge set is E. 


Proof. Consider the set of maps a: € > Zg satisfying the following: 


(a) If a@(A) = 1, then a(A*) = 0. If a(A) = 0, then a(A*) = 1. 
(b) If (A) =1 and Ac B, then a(B) = 1. 


The vertex set of T will be a subset of the set of all maps satisfying 
(a) and (b). 

Put ET = € and for A e€ €, put :A =a where a(B) =1if ACB 
or if A* Cc B. Put tA = 1A* Thus tA = B where 6(B) = 1 if A* C B 
or if A C B. Then tA and TA take the same value on every B except if 
B= Aor B= A*. We define VT to be the set of all functions which are 
tA for some A € E. It is fairly easy to check that 1A satisfies conditions 
(a) and (b). If u = 7A and v = TB, then the directed edges in a path 
joining u and v consist of the set {C € E|A C C C B}. Using (iii) this 
set is totally ordered by inclusion and if it is finite by (ii) so is the unique 
geodesic joining u and v. Thus T is a tree. 


If € is a nested set of cuts, satisfying (i) (ii) and (iii), in a graph X and 
there is a bound on the capacity of cuts in €, then the set of all maps 
a:€ — Zz satisfying (a) and (b) can be identified with VT UQT. Thus 
a ray in T corresponds to a strictly £} > Fo D ... of cuts in € and if 
we put a(F) = 1 if for 7 sufficiently large FE D> £;. It follows from the 
finite interval condition (iii) that the intersection of all the E;’s is the 
empty set and that for any cut A for 7 sufficiently large either £; Cc A 
or E£; C A*. Ifa: € > Zz satisfies (a) and (b) and there is a unique 
minimal B € € for which a(B) = 1, then a = vB. If there is no such B 
then we can find a strictly EF, D EoD... of cuts in € such that af; = 1 
for every 7. Thus @ corresponds to a ray in T. 

We can identify a vertex v of X with a map v: BX — Zp». Thus 
v(A) = 1 if v € A and v(A) = Oif v € A. Restricting to € will give a 
vertex of T. Thus there is a map v : VX — VT such that v(q@), v(B) 
differ only on the cuts separating @ and f. 

Note that there may be vertices of J which are not in the image of v. 
Each directed edge e of X will give a finite directed path in T consisting 
of those cuts A € € for which re € A and e € 6A. A ray in X will 
correspond to a path in T by concatenating the paths for each edge. It 
may be the case that this path may backtrack. It will determine a ray in 
T unless it visits a particular vertex of T infinitely many times, in this 
case. v maps the end containing the ray to the vertex visited infinitely 
often. Note that because of Proposition 2.4 it cannot visit two distinct 
vertices infinitely many times. Thus we can extend v so that it is a map 
vi: VX NQX > VTOQT. 

If € c BX satisfies the above conditions, then there is a tree T(E). If 
G is the automorphism group of X and € is a G-set, then T which is a 
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G-tree, is called a structure tree for X. If T = T(E) is a structure tree 
for X, then the map v: VX — VT defined above is a G-map. 

We now show that a structure tree determines a decomposition of the 
graph X, in the same way that a group G acting on a tree determines a 
decomposition of the group G. 

Let T = T(E) be a structure tree for X corresponding to a nested set 
of tight cuts € satisfying the finite interval condition (iii). Let v € VT. 

Let v: VX — VT be the map defined earlier. We define a graph Xy 
as follows. We take EXy C EX to be the set of edges which lie in 6C' for 
some edge C of T incident with v, together with those edges e such that 
v maps both vertices of e to v. We take VX, to be the set of vertices of 
these edges, but we identify vertices x, y if they both lie in C* when C 
has initial vertex v. 

Each vertex x of X for which vz = v is a vertex of VX,. Such a 
vertex is called a v-vertex, but there may be no such vertices. There is 
another vertex of Xy for each cut C' with initial vertex v, and this vertex 
is obtained by identifying all the vertices of 6C’ that are in C*. Such a 
vertex is called a p-vertex. A p-vertex has degree [dC]. 

It is fairly easy to see that Xy is connected. Thus any two vertices of 
X are joined by a path in X. If xz, y are two vertices of X that become 
vertices of Xy after carrying out the identifications just described, then 
the path in X will become a path p in X, if we delete any edges that 
are not in Xy. Here we use the fact that C* is connected, and when p 
enters C* at vertex w it must leave C* at a vertex w’ that is identified 
with w in Xy. 

In a similar way a ray in X, corresponds to a ray in X. If the ray passes 
through a vertex corresponding to the cut C’,, then the two incident edges 
will both lie in 6C. There will be a path in C* joining the corresponding 
vertices before they are identified. For each such vertex that is visited 
by the ray we can add in this path to obtain a ray in X. This ray will 
belong to an end w € QX such that vw = v. 

We regard the graphs X, for each v € VT as the factors in the 
decomposition for X. We now describe how they fit together to give X. 
For each edge e € EX there are only finitely many EF e€ € such that 
e € 6K. These edges form the edges of the geodesic in T joining vu 
and vv where u,v are the vertices of e. Suppose there are k(e) such 
edges. Now form a new graph X’ in which each edge e is subdivided 
into k(e) edges. We can extend v : VX — VT to a map also denoted 
v : VX' + VT which can be extended to a graph morphism. It will 
now be the case that v is surjective. For each cut A € €, there is a cut 
A’ € BX’ in which 6A’ consists of those edges of X’ that are mapped to 
the edge A of ET under the extended morphism v : X’ > T. We have 
that A’N VX = A and |éA’| = |éAl. 

Clearly we have a nested set of cuts €’ that will be the edge set of a 
structure tree T which is isomorphic to JT’, and can be identified with T 
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in a natural way. For v € VT the graph X/, will be a subdivision of the 
graph X,. Each edge of X, that joins two p-vertices is subdivided into 
two edges in which the centre vertex is a v-vertex in X’. 

It is easier to use X’ rather than X to understand the structure of 
the graphs X,. This is because every edge of X’ lies in at most one 
6A for A € €' and if u,v are the vertices of A € ET = ET’, then Xy 
and X, are the only factors in the decomposition of X that contain 
the edge e. If G is a group acting on X, then it will also act on T. For 
v € VT the stabiliser Gy will act on Xy. Two directed edges of Xj, will lie 
in the same G-orbit if and only if they lie in the same Gy-orbit. If u,v 
are distinct vertices of T, then edge sets of X/, and Xj, are disjoint 
unless u.v are adjacent in T in which case the intersection consists of 
the edges that map to the edge with vertices vu and vv in T. It follows 
that if X has finitely many G-orbits, then each X, has finitely many 
Gy-orbits. We have a decomposition of X in which the factors are Xy. 


Lemma 2.7. If A,B are crossing thin cuts, with c(A) = m, c(B) =n, 
then after relabelling A as A* and B as B* if necessary, both AN B*, 
A* 1 B are thin cuts with capacities m,n respectively. 


Proof. We refer to Figure 5.3. Suppose m < n. Suppose A is thin with 
respect to x, y with x € A and B is thin with respect to 2’, y’ with 2’ € B. 
After possible relabelling we can assume a < b, c < d. If a < c, then 
c(AN B) < n and c(A* NB) < n and so B is not thin since one of these 
two corners separates x’ and y’. If c < a, then A is not thin. Hence a = c. 
Ifa < b, then c(ANB) = 2a+f < a+e+f+b =m, and soz € ANB* and 
c(AN B*)=a+t+e+c=mandf=0. Also c(A*NB)=at+e+d=n, 
and 2’ € A* B and so it is thin, and we are done. If a = 6, then 
m=2a+e+f<ate+f+td=nandsob<d.Thusa=b=c<d. 
If e is not 0, then c(AN B) < m,c(A* NM B*) < n and the lemma 
follows easily. If e = 0 and f 4 0, then c(AN B*) < m,c(A*N B) <n 
and the lemma follows if we relabel A as A*. But if e = f = 0, then 
c(AN B) = c(AN B*) = m and c(A* N B) = c(A* N B*) = n. In this 
situation it is not possible that two adjacent corners of A, B are not thin. 
Thus for one pair of opposite corners we have that both corners are thin. 
By relabelling we can assume these corners are AM B* and A*M B and 
the lemma is proved. 


Lemma 2.8. Let A,B,C be cuts. 


(i) Let A,B be not nested and let C be nested with both A and B, 
then C is nested with every corner of A, B. 


(ii) If C is nested with A, then C is nested with two adjacent corners 
of A and B. 


Proof. For (i) by possibly relabelling A as A* and/or B as B* and/or 
C' as C* we can assume either 
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(a) CC Aand Cc Bor 
(b) Cc Aand C* cB. 


If (a) then C C ANB and C is contained in the complement of each 
of the other corners. If (b), then B* C C Cc A, and so A, B are nested, 
which contradicts our hypothesis. 

For (ii) if AC C, then AN BCC and ANB* CC. 


Let C be a set of cuts. Let A be a cut and let M(A,C) be the set of 
cuts in C which are not nested with A. Set w(A,C) = |M(A,C)|. 


Lemma 2.9. Let C be a nested set of tight cuts. Let A be a tight cut 
which is not nested with some B €C. Let u(A) = w(A,C) be the number 
of cuts in C that are not nested with A. then 


u(AN B,C) + w(AN B*,C) < w(A,C). 


Proof. If C €C is nested with A, then it is nested with both A and B 
and so it is nested with AM B and AN B* by Lemma 2.8. If C is not 
nested with A, then it must be nested with one of dN B and AN B*. 
For if, say, C C B then B* Cc C* and so AN B* Cc C*. Thus C is not 
nested with at most one of AN B and AN B* and the lemma follows, 
since B is counted on the right but not on the left. 


Let C,, be the set of thin cuts with capacity n. 


Theorem 2.10. There is a uniquely defined nested set of thin cuts E 
in which En ={E € E|c(E) < n} constructed inductively as follows:- 
€, =Cj. 
If Dn = {A € Cy|W(A, En_1) = O}, then En = En_1 UD, where Di, 
consists of all those cuts D € Dn, satisfying 
(*) D is thin with respect to some u,v € VX UQX and w(D,Dy) 
takes the minimal value among all D € Dn that are thin with 
respect to u,v. 


A cut in Dy satisfying (*) is said to be optimally nested with respect 
to U,V. 


Proof. This is an argument from [4]. Put w(A) = w(A,D,). Let 
A,B € Dy be not nested. Each corner of A,B is nested with every 
e € En_1 by Lemma 2.8. Suppose A is optimally nested with respect to 
x,y and B is optimally nested with respect to 2x’, y’. Here x, y, 2’, y’ are 
elements of VX U QX. Each of 2, y, 2’, y’ determines a corner of A, B. 
There are two possibilities. 


(i) The sets x, y determine opposite corners, and x’, y’ determine the 
other two corners. 

(ii) There is a pair of opposite corners such that one corner is 
determined by one of x,y and the opposite corner is determined 
by one of 2’, y’. 
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In case (i) A and B separate both pairs x, y and z’, y’. Because A, B are 
optimally nested with respect to x,y and 2’, y’, we have w(A) = w(B). 
But now both AN B and A* NM B* separate x,y say and c(AN B) = 
c(A* N B*) = n by Lemma 2.7 so that both the corners are in Dy, and 
M(ANB)+u(A*N B*) < w(A)+u(B) = 2u(A), by Lemma 2.8, since if an 
element of Cy, is not nested with both AN B and A*N B* it is not nested 
with both A and B and if it is not nested with one of AN B, A*N B* then 
it is not nested with one of A and B. The strict equality follows because 
A€ Dn separates x, y and is not nested with B but both AN B, A* nN B* 
are nested with A and B. Since both AN B, A* MN B* separate x and y 
we have a contradiction. 

In case (ii) suppose these corners are AM B and A* | B*, and 
that c € AN B,y’ € A* NM B*. But then AN B separates x and y and 
A*(B* separates x’ and y’. Given that A is optimally nested with respect 
to x and y we have w(AN B) > w(A) and because B is optimally nested 
with respect to x’ and y’ we have w(A*M B*) > u(B). But it follows, as 
in the previous paragraph, that w(AM B) + w(A*N B*) < w(A)+ u(B) 
and so we have a contradiction. Thus D/, is a nested set and the proof 
is complete. 

Note that € is uniquely defined, as no choices are made in its 
construction. This is very important in applications. It means that € 
is invariant under the automorphism group of the graph. 


Recall that BX is the Boolean ring of all cuts. Let By, X be the ring 
generated by all cuts A such that c(A) < n. 


Theorem 2.11. For every u,v € VX UQX that can be separated by 
a cut, E contains a cut A that is thin with respect to u,v. The set En 
generates By, X and is the directed edge set of a tree. 


Proof. To prove the first statement we need to show that for every u, v 
there is a cut A that is thin with respect to u,v that is nested with every 
cut EF € E,_1, where n = c(A), i.e. we need to show that there is a cut 
in D,, that is thin with respect to u,v. We know that there is a cut B 
that is thin with respect to u, v. Let k = uw(B,Ep_1). If k = 0, then we 
take A = B. If k > 1, then let C € E€n_1 cross B. We know from Lemma 
2.7 that one of the corners (say BN C) of B, C is thin with respect 
to u,v. By Lemma 2.9 we have u(BN C)+ u(BN C*) < k. Thus if 
u(B) = k > 0 we can find a cut BN C which is thin with respect to u,v 
for which u(BNM C) < k. Thus there must be a cut A which is thin with 
respect to u,v for which w(A) = 0. 

Let By be the subring of 6 generated by €,. Clearly By C By, is the 
subring generated by all cuts with capacity at most n. We want to show 
that B, = B,. Let A be a cut with c(A) = n. We show that A € By, 
by induction on (A) = w(A, En). Suppose that w(A) = 0 so that A is 
nested with every cut in Ey. In this case if A ¢ Ey then A determines a 
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vertex uv of T = T(E,). Thus we define v : En > Zo, vE = 0,if EC A, 
or EC A*,vE =1,if E* C A,or E* C A*. 

We have said that tH = v if FE C A and E is a maximal element of 
En with this property. 

Let X, be the graph defined earlier. 

The cut A will become a cut Ay in X,. Thus Ay will consist of all 
v-vertices x € A such that vz = v and also and also all p-vertices 
y corresponding to cuts C’ € €, such that C* Cc A. It will then be 
the case that 6A = d5Ay. Now A € B,, if and only if either Ay, or At 
consists of finitely many -vertices. If this is not the case, then both Ay 
and A*, either contain infinitely many p-vertices or at least one v-vertex. 
However if say Ay consists only of infinitely many p-vertices, then Ay is a 
connected subset of X, consisting of infinitely many vertices of bounded 
degree. Such a set must contain the vertices of a ray by Konig’s Lemma. 
It follows that if A is not in By, then it must separate two elements of 
VX UQX. But these two elements must be separated by an element of 
En which is not the case, and so we have a contradiction. Thus Ay or 
A* consists of finitely many p-vertices and so A € Bp. 

If u(A) > O and E € Ey is not nested with A, then A = ANE+ ANE", 
and both AN F and AN E* are in By, by induction on k = w(A). The 
theorem is proved. 


If every pair z,y € VX UQX that can be separated by a cut in En, 
then BX = B,X. If BX = B,X and va = vy = v, then z,y cannot 
be separated in X or X,. This means that either both x,y and every 
other vertex of Xy have infinite degree and X, has one end or « = y 
and Xy contains at most one v-vertex. If v is not in the image of v, and 
BX = B,X, then X, will be a one-ended graph in which each vertex 
has degree at most n. 

Thomassen and Woess [32] define a graph to be accessible if there is 
an integer n such that any two ends can be separated by removing at 
most n edges. Alternative ways of defining accessibility are suggested by 
Theorem 2.11. 


Definition 2.12. A graph X is said to be B-accessible if BX = By, X 
for some n. A graph X is said to be €-accessible if € satisfies the finite 
interval condition (iii) of Theorem 2.6. 


A graph X is 6-accessible if and only every pair 2, y €¢ VX UQX that 
can be separated by a cut can be separated by a cut in €,. There is then 
a structure tree J, anda map v: VX UQX > VT;,,U QT» such that 
vz # vy if and only if x, y are not separated by any cut. 

A graph X is €-accessible if and only if € is the directed edge set of 
a structure tree T for which there is a map v: VX UQX — VTUQT 
in which vz = vy if and only if x, y are not separated by any cut A. If 
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X is €-accessible, then for every v € VT the graph X, has the following 
structure: 


Xy has at most one end. Every vertex of infinite degree is a v vertex. 
The vertices of finite degree consist of p-vertices and at most one 
v-vertex. 


Clearly it follows from Theorem 2.14 that X is accessible if BX = B,X 
for some n. If every vertex of X has bounded degree, then BX = By, X 
for some n if and only if X is €-accessible, and all three definitions of 
accessible are equivalent. Wall [34] defined a finitely generated group to 
be accessible if any process of successively splitting the group over finite 
subgroups eventually terminates with factors that are finite or one-ended. 
By Bass-Serre theory this is equivalent to saying that the group has an 
action on a tree with every edge group finite and every vertex group is 
finite or one-ended. 

As proved in [32] a finitely generated group is accessible if and only 
if its Cayley graph (with respect to a finite generating set) is accessible. 
I proved in [8] that finitely presented groups are accessible and in [9] 
I gave an example of a finitely generated group that is not accessible. 
Thus there are vertex transitive locally finite graphs that are not 
accessible. 

It is fairly easy to construct graphs that are €-accessible but not 
B-accessible or accessible. A graph that is B-accessible is both accessible 
and €-accessible. 


Theorem 2.13. A locally finite accessible graph is E-accessible. 


Proof. Suppose that any two ends of X are separated by an n-cut, so 
that X is accessible. Let T = Ty, be the structure tree with edge set 
En. For each v € VT, Xy has at most one end. We have to show that 
any edge of X lies in finitely many 5A for A € €. We know that any 
edge of X‘ lies in at most two graphs X;, and so any edge of X lies in 
finitely many X,. It therefore suffices to show that any edge of Xy lies 
in finitely 5A for A € €. Such an A can be regarded as a cut in X,. Thus 
it suffices to prove the theorem when X has at most one end. Clearly 
the result is true if X is finite. By Konig’s lemma, if X is infinite then 
it has a ray and so in our case it has one end. There will be an infinite 
sequence of elements A; € € such that Aji; C A; whose intersection is 
empty. Because X has one end Aj is finite. If Aj € Eni), then the edges 
in Tra) that separate vertices of A¥ form a finite subtree and this will 
be a subtree of every tree Ty, for m > n(i). It follows that every edge 
of X lies in finitely many 6A for A € E. 


I think it ought to be possible to drop the locally finite condition in 
the last theorem. 

In [11] I showed that a vertex transitive locally finite planar graph is 
accessible. A locally finite graph X has a Freudenthal compactification 
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F(X) in which distinct ends correspond to distinct points. Richter and 
Thomassen [27] show that if X is connected with a locally finite planar 
graph then F(X) can be embedded in S?, and this embedding has a 
uniqueness property if X is three-connected. If A is a tight cut in such 
a graph X, then there is a simple closed curve in S? that intersects 
F(X) in a finite set of points consisting of a single point in each edge of 
5A. The set € will correspond to a set of non-intersecting simple closed 
curves. The graph X will be €-accessible if and only if every edge of X 
intersects finitely many of the simple closed curves. The structure tree 
corresponding to € will then be the dual graph to the set of curves. 

It would be interesting to know if every planar graph is €-accessible. 
If this was the case, then a planar graph of bounded vertex degree 
would be B-accessible. A locally finite €-accessible graph has a unique 
decomposition in which the factors are one ended or finite. 

Combining the two previous theorems we have the following: 


Theorem 2.14. Let N(X) be a network in which X is an arbitrary 
connected graph. For each n > 0, there is a network N(Ty) based on 
a tree T = Ty anda mapv: VX VUQX > VT UQT, such that 
v(VX) Cc VT and va = vy for any z,y € VX UQX if and only if x,y 
are not separated by a cut A with c(A) < n. 

The network N(Ty) is canonically determined and is invariant under 
the automorphism group of N(X). 


For a finite network Theorem 2.14 reduces to Theorem 2.2. For a finite 
network the structure tree of Theorem 2.2 will become a Gomory-Hu 
tree by contracting certain edges. Thus a Gomory-Hu tree has the same 
properties as our tree except that the map v: VX — VT is a bijection. 
One obtains a Gomory-Hu tree from our structure tree by choosing for 
each vertex v € VT that is not in the image of v an incident edge 
of maximal capacity and then contracting all those edges. If we choose 
the edges to contract in the way just described one will preserve the 
property that for any s,t € VX, there is a minimal cut separating s, t in 
the geodesic joining vs, vt in T. Note that while the tree of Theorem 2.2 
is uniquely determined there may be more than one Gomory-Hu tree. 
This has already been noted in the tree of Figure 5.2. 

Let n be the smallest capacity of a cut in N(X). It can be seen fairly 
easily from Figure 5.3 that if A,B are cuts with c(A) = c(B) = n 
and A is not nested with B, then n = 2m and 6A partitions 6A = 
(6(AN B)N 6A) Ud(AN B*) NSA), where each of 6(AN B) NSA and 
6(AN B*) 6A, contain m edges. If C is also a cut with capacity n that 
is not nested with A, then one can show that the partition of 6A given 
by C is the same as that corresponding to B. This result is crucial in the 
cactus representation of mincuts by Dinits, Karzanov and Lomonosov [5]. 
A cactus is a simple graph in which every edge lies in at most one cycle. 
In the cactus representation there is a cactus K and a mincut in N(X) 
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Figure 5.3 Crossing cuts 


Figure 5.4 Cutting up a ladder 


corresponds to a tight cut in K with capacity at most 2. In our notation 
the mincuts in N(X) are the elements of Cj, = Dy and the elements of 
D}, = En correspond to the tight cuts & in K in which 6E has one edge 
or it consists of adjacent edges of a cycle. 

Evangelidou and Papasoglu [19] use a similar cactus argument but 
for minimal cuts separating ends of a graph to give a proof of Stallings 
theorem. 

We illustrate our results with some simple examples. 

In the example of Figure 5.4, the graph X is an infinite ladder, and 
every edge has capacity one. The two ends of X and some vertices are 
separated in T> and all ends and vertices in 73. The vertices of 73 on 
the horizontal central line are not in the image of v. Note the cuts A, A* 
given by the bolder dashed line are not thin with respect to the two 
ends of the graph since c(A) = 3 and the two ends are separated by a 
cut with capacity 2. However, A is thin with respect to the two vertices 
u,v of one rung of the ladder. However, A is not optimally nested with 
respect to u,v since it is not nested with the cut aA where alpha is the 
automorphism swapping the two sides of the ladder. The cuts given by 
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Figure 5.5 Cutting up another ladder 


the blue black dashed lines are nested with every thin cut and so are 
optimally nested with respect to any pair of vertices that they separate. 

In the example of Figure 5.5, all the vertices are separated in Ty and 
all ends and vertices in T5. There is a vertex of infinite degree in each 
of T3 and Ty. 

A finitely generated group G is said to have more than one end if a 
Cayley graph X = X(G, S$) of G corresponding to a finite generating set 
S has more than one end. 


Theorem 2.15 (Stallings theorem). If G is a finitely generated group 
with more than one end, then G has a non-trivial action on a tree T 
with finite edge stabilizers. 


Proof. Let n be the smallest integer for which there is a cut A such that 
|5A| = n which separates two ends s,t. The structure tree T = Tp, will 
have the required property. Here we use the fact that the construction 
of T is canonical and so is invariant under the action of automorphisms. 
Thus the action of G on X gives an action on T. Each edge of T is a 
cut C with |6C| < n. The stabilizer of C will permute the edges of 6C 
and will therefore be finite. We also have to show that action is non- 
trivial. We know there is a cut D in ET that separates a pair of ends. 
For such a cut both D and D* are infinite. The action of G on X is 
vertex transitive. There exists g € G such that the vertices of giD are 
contained in D and an element h € G such that the vertices of héD 
are contained in D*. It then follows that for = g,z = h or x = gh we 
have xD is a proper subset of D or D is a proper subset of 2D. It follows 
from elementary Bass-Serre theory that « cannot fix a vertex of T. 
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This proof is essentially that of [21]. 

In any tree T if p is a vertex and Q is a set of unoriented edges, then 
there is a unique set of vertices P such that v € P then the geodesic 
[v, p] contains an odd number of edges from Q. We then have 6P = Q. 
Note that BT = B,T and every element of BT is uniquely determined 
by the set Q together with the information for a fixed p € VT whether 
p € Aor p € A*. The vertex p induces an orientation Op on the set of 
pairs {e, €} of oriented edges by requiring that e € O if e points at p. For 
Aé BT, A is uniquely determined by 6A together with the orientation 
O, N6A of the edges of 5A. 

In X it is the case that a cut A is uniquely determined by 6A together 
with the information for a fixed p € VX whether p € A or pé A*. 

Since B,X is generated by En, = ETp, the cut A can be expressed 
in terms of a finite set of oriented edges of T,,. This set is not usually 
uniquely determined. Thus if v is not surjective, and v is not in the 
image of v, and the set of edges incident with v is finite, then VX is 
the union of these elements in BX. The empty set is the intersection 
of the complements of these sets. Orienting the edges incident with v 
towards v gives the empty set and orienting them away from v gives 
all of VX. However there is a canonical way of expressing an element of 
B,X in terms of the generating set En. To see this let A € By X —Byn_1X. 
There are only finitely many C € €, with which C is not nested. This 
number is w(A,Epn) = (A). We use induction on (A). Our induction 
hypothesis is that there is a canonically defined way of expressing A in 
terms of the €,,. Any two ways of expressing A in terms of €, differ by an 
expression which gives the empty set in terms of €,. Such an expression 
will correspond to a finite set of vertices, each of which has finite degree 
in Ty, and none of which is in the image of v. The canonical expression 
is obtained if there is a unique way of saying whether or not each such 
vertex is in the expression for A. Thus the canonical expression for A is 
determined by a set of vertices of VT which consists of the vertices of 
v(A) together with a recipe for deciding for each vertex which is not in 
the image of v whether it is in the expression for A. 

Suppose 44(A) = 0, so that A is nested with every C € En, and neither 
A nor A* is empty. If A € En, then this gives an obvious way of expressing 
A in terms of the €,. If A is not in En, then it corresponds to a unique 
vertex z € VT. Thus because (A) = 0, A induces an orientation of the 
edges of En. To see this, let C € En, then just one of C C A, C* C A, 
C Cc A*, C* Cc A* holds. From each pair C, C* we can choose C if C Cc A 
or CC A* and we choose C* if C* C A or C* Cc A*. Let O be this subset 
of €. Then if Ce O and De € and DC C, then D € O. This means 
that the orientation O determines a vertex z in VT),. Intuitively the 
edges of O point at the vertex z. It can be seen that A or A* will be the 
union of finitely many edges EF of E€, = ETp, all of which have tE = z. 
If A is such a union, then we use this to express A = Cy U Co,---U Cy. 
If A is not such a union, but A* = Cy U Cy,---U Cy, then we write 
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A=(C,U Cy,---U Cy)* = CEN CFN---N Ch. Note that this gives a 
unique way of expressing cuts corresponding to a vertex z of finite degree 
not in the image of v. The vertex z is included in the expression for A* if 
and only if only finitely many cuts in €,, incident with z and pointing at 
z are subsets of A. Suppose then that the hypothesis is true for elements 
Be B,X for which u(B) < p(A). Let C € Ey be not nested with A. 
Then u(AN C)+ w(AN C*) < p(A). Thus each of AN C and AN C* 
can be expressed in a unique way in terms of the €,,. If at most one of 
these expressions involves C’, then we take the expression for A to be 
the union of the two expressions for AN C and AN C%*. If both of the 
expressions involve C’, then we take the expression for A to be the union 
of the two expression with C' deleted. The expression obtained for A is 
independent of the choice of C’. In fact the decomposition will involve 
precisely those C’ for which C’ occurs in just one of the decompositions 
for AN C and AN C*. We therefore have a canonical decomposition 
for A. To further clarify this proof observe the following. The edges C, 
which are not nested with A, form the edge set of a finite subtree F’ 
of T,. If EF #4 @ we can choose C' so that it is a twig of F, ie. 
so that one vertex z of F is only incident with a single edge C' of F. 
By relabelling C as C™ if necessary we can assume that “(AN C) = 0. 
The vertex determined by AN C as above is z, and we have spelled 
out the recipe for if this vertex is to be included in the expression 
for A. The induction hypothesis gives us a canonical expression for 
ANC, which together with the expression for AN C' gives the expression 
for A. 


2.3. Flows in Networks 


In this subsection we give a version of the Max-Flow Min-Cut theorem 
for arbitrary networks that reduces to the usual theorem for a finite 
network. Let X,N be as before. For s,t € VX UQX an (s,t)-flow in 
N isamap f: EX — {0,1,2,...} together with an assignment of a 
direction to each edge e for which f(e) 4 0 so that its vertices are ve 
and te and the following holds. 


(i) For each v € VX there are only finitely many incident edges e for 
which f(e) £0. 

(ii) If fT@) = U(f(e)|te = v) and f7(v) = U(f(e)|te = v), then 
f*(v) =f7() for every v 8, t. 

(iii) For every cut A that does not separate s,t if we put fT(A) = 
XU(f(e)le € 6A,te € A) and f(A) = U(f(e)le € bA,te € A*), 
then we have ft (A) = f~(A). That is, for every cut that does not 
separate s,t, the flow into the cut is the same as the flow out. 


Proposition 2.16. For any (s, t)-flow and any cut A such that s € A, 
t € A*, the value ft(A) — f~(A) does not depend on A. This value is 
denoted |f |. 
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Proof. Let A,B be cuts separating s,t. Because AN B also separates 
s,t, it suffices to prove that f*(A) — f~(A) = ft(B) — f7(B) when 
AC B. Let e € 5A. Either e € 5B or e € 86(BN A*). If e’ € SB is not 
in 6A then e’ € 6(BN A* and 6(BN A*) partitions into those edges with 
both vertices in A and those with both vertices not in A. Because A*1 B 
does not separate s, t, fT (A*N.B) = f~(A*NB) and the value of ft —f7 
on the edges of 6(A* MN B) that are in A is minus the value on the edges 
not in A. The symmetric difference of 6A and 6B consists of the edges 
in 6(A* QB) and it follows that ft(A) — f-(A) = ft (B) — f7(B). 


Let T = Ty, be the structure tree for N(X). We have a network N(T) 
based on T in which each edge has capacity at most n. If u,v € VT 
there is a unique maximal (u, v)-flow f(u,v) in which |f(u, v)| is the 
minimal capacity of an edge in the geodesic path [u, v] joining u,v. An 
(s, t)-flow f in N such that |f| < n, induces a (vs, vt)-flow f in T such 
that |f| = If. In T an end corresponds to a set of rays. For any vertex 
of T there is a unique ray starting at that vertex and belonging to the 
end. If ue VT and v € QT, then there is a unique ray starting at u and 
representing v. While if u,v € QT, then there is a unique two-ended 
path in T whose ends represent u and v. In each case we get a (u, v)- 
flow in T by assigning a constant value on the directed edges of the 
path, provided this constant is less than or equal to the capacity of each 
edge in the path. Every (u, v)-flow is of this type. A maximal (wu, v)-flow 
f(u, v) is obtained by taking this constant to be the minimal capacity of 
an edge in the path. 


Theorem 2.17 (MFMC). Let N be a network based on a graph X . Let 
s,t € VX UQX. The maximum value of an (s, t)-flow is the minimal 
capacity of a cut separating s and t. 


Proof. Let n be the minimal capacity of a cut in X separating s, t. 
In the structure tree T = T, there is a flow from vs to vt with the 
property that the value of the flow is n. We have to show that each such 
flow corresponds to a flow in X. 

If s,t € VX, then this follows from the usual proof of the theorem, 
which we repeat here. Suppose we have an (s,t)-flow f in N. Let 
€1, €2,-..e, be a path p joining s and ¢ with the following property. 
Each edge e; is given an orientation in the flow f. This orientation will 
not usually be the same as that of going from s to t. We say that p is 
an f-augmenting path if for each e; for which ve is s or a vertex of ej-1 
we have f(e;) < c(e;), and for each edge e; for which te = ¢t or te isa 
vertex of e;41 we have f(e;) # 0. For any flow augmenting path p we 
get a new flow f* as follows. 


(i) If e € EX is not in the path p, then f*(e) = f(e). 

(ii) If e is in the path p and f(e) = 0, then orient e so that ve is s or 
a vertex of e;_1, and put f*(e) = 1. Recall that we are assuming 
that c(e) 4 0 for every e € EX and so we have f*(e) < c(e). 
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(iii) If e is in the path p and ¢e is ¢ or a vertex of ej41(so that € = e;41) 
and f(e) #0, then f*(e) = f(e) — 1. 

(iv) If e is in the path p and ve is s or a vertex of e;_1 (so that e = e;), 
then f*(e) = f(e) +1. 


The effect of changing f to f* is to increase the flow along the path p. 
We have |f*| = |f| +1. 

Let Ss C VX be the set of vertices that can be joined to s by a flow 
augmenting path. If ¢ € S-, then we can use the flow augmenting path 
joining s and ¢t to get a new flow f*. We keep repeating this process 
until we obtain a flow f for which Sy does not contain t. But now Sr 
is a cut separating s and t. Also if e € 5S-, then we have ve € Sp and 
f(e) = c(e), since otherwise we can extend the f-augmenting path from 
s to ve to an f-augmenting path to te. Thus |f| = c(S;). But n is the 
minimal capacity of a cut separating s and ¢ and so |f| > n. But also 
|f| must be less than the capacity of any cut separating s and ¢ and so 
|f| =n, and Sy is a minimal cut separating s and t. 

If se VX and t € QX, then we can build up a flow from s to ¢ in the 
following way. Let D be a cut in ET separating s and t, so that s € D 
and c(D) > n. Let Xp be the graph defined as follows. The edge set 
EXp consists of all edges e of X which have at least one vertex in D, 
so that either e € 6D or e has both vertices in D. The vertex set VXp 
consists of the vertices of EXp, except that we identify all such vertices 
that are in D*. Let this vertex be denoted d*. Thus in Xp the edges 
incident with d* are the edges of 5D. Since c(D) > n, then as in the 
previous case there is a flow fp from s to d* such that |fp| = n. Let Xp« 
be the graph defined as for Xp, using D* instead of D. We now have a 
vertex d € VXc« whose incident edges are the edges of 6D. Now choose 
another edge E 4 D, such that E* C D*, separating s,t. Thus s € E. 
Now form a graph X (D*, F) whose edge set consists of those edges that 
have at least one vertex in D* MN EF and whose vertex set is the set of 
vertices of the set of edges except that we identify the vertices that are 
in D and also identify the vertices that are in E*. Thus in X(D*, F) 
there is a vertex d whose incident edges are those of 5D and a vertex 
e* whose incident edges are those of 6E. The flow f already constructed 
takes certain values on the edges of 6D. We can find a (d, e*) flow which 
takes these same values on 6D. This flow together with the original flow 
will give an (s, e*)-flow also denoted f such that |f| = n. We can keep 
on repeating this process and obtain the required (s, t)-flow. 

If s,¢ are both in QX, choose a minimal cut M separating s and f¢, 
so that s € M,t € M*. Let Xs be the graph defined as follows. The 
edge set EX, consists of all edges e of X which have at least one vertex 
in M, so that either e € 6M or e has both vertices in M. The vertex 
set VX, consists of the vertices of EX,, except that we identify all such 
vertices that are in M*. Let this vertex be denoted m:. Thus in X, the 
edges incident with mz are the edges of 6M. If c(M) = n, then by the 
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previous case there is a flow f; from s to m: such that |f,| = n. If we 
carry out a similar construction for M* we obtain a flow ft from ms; to 
t with |f;| = n. We can then piece these flows together to obtain a flow 
in X from s to ¢ with |f| =n. 


The following interesting fact emerges from the above proof in the 
case when s,t € VX. If s,t € VX, the cuts in C € ET, such that 
sé€C,teé C* form a finite totally ordered set. It is the geodesic in ET, 
joining vs and vt. Let D be the smallest minimal cut with this property. 
Then S¢ C D, since for any vertex u € D* there can be no f-augmenting 
path joining s and wu. But this must mean that S¢ = D, since S- € By X 
which is generated by ET,. Although the maximal flow between s, t is 
not usually unique, the smallest minimal cut separating s,¢ is unique. 
The way of obtaining D by successively increasing the flow between s 
and t is obviously not a canonical process, as we choose flow augmenting 
paths to increase the flow. 

One might think that one could use a structure tree approach to reduce 
any question about cuts and flows to one about cuts and flows in a 
structure tree. However, this is not always possible. Thus earlier in this 
section it is shown that a cut A in 6, X has a canonical representation 
in terms of €, and therefore corresponds to a cut A’ in B, T. However 
the capacity of A is not usually the same as the capacity of A’. Thus in 
Figure 5.4 the cut A corresponding to the dashed line has capacity 3. 
This cut is the union of a cut (with capacity 2) and a cut (with capacity 
3) and in 73 corresponds to a cut with capacity 5. 


3. Almost Invariant Sets 


3.1. Relative Structure Trees 


We prove Conjecture 1.3 in the case when G is finitely generated over 
Hf, i.e. G is generated by H US where S is finite. 

First, we explain the strategy of the proof. Suppose that we have a 
non-trivial G-tree T in which every edge orbit contains an edge which 
has an H-finite stabiliser, and suppose there is a vertex o fixed by H. 
Let Ty be an H-subtree of T containing 0 and every edge with H-finite 
stabiliser. The action of H on Ty is a trivial action, since it has a vertex 
fixed by H, and so the orbit space H\Ty is a tree, which might well 
be finite, but must have at least one edge. Our strategy is to show that 
if G is finitely generated over H and there is an H-almost invariant set 
A satisfying AH = A, then we can find a G-tree T with the required 
properties by first deciding what H\Ty must be and then lifting to get 
Ty and then T. 

We show that if G is finitely generated over H, then there is a G-graph 
X of which there is a vertex with stabiliser H and in which a proper 
H-almost invariant set A satisfying AH = A corresponds to a proper 
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set of vertices with H-finite coboundary. It then follows from Theorem 
2.14, that there is a sequence of structure trees for H\X. We choose one 
of these to be H\ Ty, and show that we can lift this to obtain Ty and 
then T itself. 

For example, if G = Hx x L, then there is a G-tree Y with one orbit 
of edges and a vertex o fixed by H, and every edge incident with o 
has H-finite stabiliser. Suppose that K, LZ are such that these are the 
only edges with H-finite stabilisers. Then H\Ty has two vertices and 
one edge. When we lift to Ty we obtain an H-tree of diameter two in 
which the middle vertex 0 has stabiliser H. The tree T is covered by the 
translates of Ty. 

On the other hand, if G = Lx*x H where K is finite, and T is as 
above, then every edge of T is H-finite and so Ty is T regarded as an 
H-tree. The fact that our construction gives a canonical construction for 
H\Ty means that when we lift to Ty and T we will get the unique tree 
that admits the action of G. 

We proceed with our proof. 


Lemma 3.1. The group G is finitely generated over H if and only if 
there is a connected G-graph X with one orbit of vertices, and finitely 
many orbits of edges, and there is a vertex o with stabiliser H. 


Proof. Suppose G is generated by HUS, where S is finite. Let X be the 
graph with VX = {gH|g € G} and in which EX is the set of unordered 
pairs {{gH, gsH},g € G,s € S}. We then have that X is vertex transitive, 
there is a vertex o = H with stabilizer H and G\X is finite. We have to 
show that X is connected. Let C be the component of X containing o. 
Let G’ be the set of those g € G for which gH € C. Clearly G’H = G’ 
and G's = G" for every s € S. Hence G’ = G and C = X. Thus X is 
connected. 

Conversely let X be a connected G-graph and VX = Go where 
Go = H. Suppose EX has finitely many G-orbits, Ge ,, Geo,..., Ger 
where e; has vertices o and g;,0. It is not hard to show that G is generated 
by HU (91, 92,---; Gr}. 


Let A Cc G bea proper H-almost invariant set satisfying AH = A. Let 
G be finitely generated over H, and let X be a G-graph as in the last 
lemma. There is a subset of VX corresponding to A, which is also denoted 
A. For any x € G, A+ Az is H-finite. In particular this is true if s € S. 
This means that 5A is H-finite. Note that neither A nor A* = VX — A is 
H-finite. Thus a proper H-almost invariant set corresponds to a proper 
subset of VX such that 5A is H-finite. 

From the previous section (Theorem 2.10) we know that 6(H\X) has 
a uniquely determined nested set of generators € = €(H\X). For FE € €, 
let E C VX be the set of all v € VX such that Hv € E. Let C bea 
component of E. 
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Lemma 3.2. Forh € H, hC = C orhCNC =@. Also HC = E, 
hbCN6C=6C or hbbCNédC = and H\6C =5E. 


Proof. Let h € H. Then AC is also a component of E, since HC C E. 
Thus either hC = C' or hCN C =. Let K be the stabiliser of C in H. 
If v € C, then hv € C if and only if h e K. Thus K\C injects into 
A\C = E and K\6C injects into 6E. But FE is connected, and so 
the image HC’ is E. It follows that there is a single H-orbit of 
components. 


Let E(H, X) be the set of all such C, and let €,(H, X) be the subset 
of €(H,X) corresponding to those C' for which 6C lies in at most n 
H-orbits. 


Lemma 3.3. The set E(H, X) is a nested set. The set En(H, X) is the 
edge set of an H-tree. 


Proof. Let C,D € €n(H,X). Then HC,HD are in the nested set €. 
Suppose HC Cc HD, then C Cc Dor CN D=4@. It follows easily that 
€(H, X) is nested. It was shown in [7] that a nested set € is the directed 
edge set of a tree if and only if it satisfies the finite interval condition, 
ie. if C,D € € and C C D, then there are only finitely many E' € € such 
that C Cc E C D. Thus we have to show that En(H, X) satisfies the finite 
interval condition. If C C Dand C C E C D where C,E,D € €,,(H, X), 
then HC C HE C AD. But €,(H,X) does satisfy the finite interval 
condition and HC = HE implies C = E. Now let CON D = & and 
suppose that 0 = H € C*fM D*. There are only finitely many E € En 
such that C Cc FE and o € E* or such that D C E* and o € E. Each 
E € En such that CC E C D* has one of these two properties. 


Let T = T(H) be the tree constructed in the last lemma. Let T = 
A\ T. Note that in the above T(H) is the Bass-Serre H-tree associated 
with the quotient graph T(H) = H\T(H) and the graph of groups 
obtained by associating appropriate labels to the edges and vertices of 
this quotient graph (which is a tree). Clearly the action of H on T(#) is 
a trivial action in that H fixes the vertex 0 = vo. The stabilisers of edges 
or vertices on a path or ray beginning at o will form a non-increasing 
sequence of subgroups of H. 

We now adapt the argument of the previous section to show that if 
A c VX is such that 6A lies in at most n H-orbits, then there is a 
canonical way of expressing A in terms of the set €(H, X). In this case 
we have to allow unions of infinitely many elements of the generating 
set. Our induction hypothesis is that if 6A lies in at most n H-orbits, 
then A is canonically expressed in terms of E,(H,X). First note that 
there are only finitely many H-orbits of elements of En, = En(H,X) with 
which A is not nested. This is because if C € Ey is not nested with A 
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and F is a finite connected subgraph of H\X containing Ho and all the 
edges of Hé6A, then H6C must contain an edge of F and there are only 
finitely many elements of €, with this property. We now let (A) be 
the number of H-orbits of elements of En, with which A is not nested. 
If w(A) = 0, then A is nested with every C € En. This then means 
that if neither A nor A* is empty and it is not already in Ep, then A 
determines a vertex z of Ty, and either A or A* is the union (possibly 
infinite) of edges of T, that lie in finitely many H-orbits. If A is such a 
union, then we use this union for our canonical expression for A. If A is 
not such a union, then A* is; we have A* = J{C,|A € A}, where each 
C,, has tC, = z and the edges lie in finitely many H-orbits. We write 
A=(U{G|A € A})* = {CFA € A}. Note that this gives a canonical 
way of expressing cuts corresponding to a vertex that is not in the image 
of v and whose incident edges lie in finitely many H-orbits. Suppose then 
that the hypothesis is true for elements B for which w(B) < p(A). Let 
C € En be not nested with A. Then w(AN HC) + w(AN HC*) < w(A). 
Thus each of AN HC and AN HC* can be expressed in a unique way 
in terms of the €,. We take the expression for A to be the union of the 
two expressions for AN HC and AN HC* except that we include hC for 
h € H, only if just one of the two expressions involves hC’. 

If g € G, then gT(H) is a (gHg~')-tree. It is the tree T(gHg—!) 
obtained from the G-graph X by using the vertex go instead of o. We 
now show that there is a G-tree T which contains all of the trees g T(H). 

We know that the action of the group G on X is vertex transitive and 
that X has a vertex o fixed by H. Also G is generated by H US where 
S is finite. 

Clearly there is an isomorphism ag : T(H) > T(gHg"') in which 
Dw gD. 

Suppose now that vo 4 v(go). Let A,B be H-almost invariant sets 
satisfying AH = A, BH = B and let g € G. We regard A, B as subsets 
of VX, so that 6A and 6B are H-finite. 

Suppose that o € gB* and go € A*. The following lemma is due to 
Kropholler [22], [23]. We put K = gHg7!. 


Lemma 3.4. In this situation 6(AN gB)is (HN K)-finite. 


Proof. Let x € G. We show that the symmetric difference (AN gB)ax + 
(AN gB) is (HN K)-finite. Since A, B are H-almost invariant, there are 
finite sets E,F such that A+ Av C HE and B+ Bar C HF. We then 
have 


(AN gB)a + (AN gB) = Ax (gBr + gB) + (Ar + AYN gB 
= Arf gHF + g(g- HEN B). 


Now Az gHF is K-finite, but it is also H-finite because gH is 
contained in A*, given that go € A*. A set, which is both H-finite and 
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K-finite is HN K-finite. Thus Arn gHF is (H1 K)-finite. Similarly using 
the fact that g~!o € B*, it follows that g-! HEN B is HN(g—!Hg)-finite, 
and so g(g- HE OB) is (HN K)-finite. Thus 4 gB is (HN K)-almost 
invariant. But this means that 6(A/M gB) is (HM K)-finite. 


What this lemma says is that if A,gB are not nested then there is 
a special corner—sometimes called the Kropholler corner—which is 
(HM K)-almost invariant. 

Notice that in the above situation all of 6A,5(AN gB*) and 6(AN gB) 
are H-finite. If we take the canonical decomposition for A, then it can 
be obtained from the canonical decompositions for AN gB and AN gB* 
by taking their union and deleting any edge that lies in both. Also 6(gB) 
is K-finite and the decomposition for gB can be obtained from those for 
gB OA and gB 1 A*. But the edges in the decomposition for AM gB 
which is (HN K)-almost invariant are the same in both decompositions. 

We will now show that it follows from Lemma 3.4 that the set GEn 
is a nested G-set which satisfies the final interval condition, and so it is 
the edge set of a G-tree. We have seen that an is a nested H-set where 
En = H\En is the uniquely determined nested subset of B,(H\X) that 
generates B,(H\X) as an abelian group. It is the edge set of a tree 
Tn(H\X). _ 

If A,B € €, and A,gB are not nested for some g € G, then by 
Lemma 3.4 there is a corner—the Kropholler corner—which we take to 
be ANgB, for which 6(AN gB) is (HN K)-finite. We then have canonical 
decompositions for AN gB and AN gB* as above. This is illustrated 
in Figure 5.6. The labels a,b,c, d,e,f are for sets of edges joining the 
indicated corners. In this case the letters do not represent edges of X but 
elements of En. Although each FE € En comes with a natural direction, 
in the diagram we only count the unoriented edges, i.e. we count the 


number of edge pairs (EZ, E*). In the diagram, AN gB is always taken 
A | A* 
a 
An gB A* 7 gB An gB A* 7 gB 
gB ee 
2k ¢ Pane d 
gB ods 
AN gB* A* 9 gB* AN gB* A* nn gB* 
b 


Figure 5.6 Crossing cuts 
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to be the Kropholler corner. Thus we have that any pair contributing to 
a,f or e must be (1H 1 K)-finite. Any pair contributing to e or b must 
be H-finite and any pair contributing to e or d must be K-finite. 

We have that at+e+f+b=1andc+e+f+d= 1. Suppose that the 
Kropholler corner AN B is not empty. It is the case that each of o and 
go lies in one of the other three corners. We know that o € gB*, go € A”. 
Ifo € ANgB* and go € A*NgB, thena=c=lande=f=b=d=0 
and A*N gB* = %. If o € A*NgB and go € A* 1 gB*, thn a=d=1 
and AM gB* = @, while if both o and go are in A* MN gB*, then either 
a=d=1and An gB* =% ora=c=1 and A*NgB=9 or f =1 and 
both AN gB* and A* 1 gB are empty, so that A = gB. In all cases A, gB 
are nested. 


(a) go 


We need also to show that GEn, satisfies the finite interval condition. 
Let g € G and let K = gHg~!. Consider the union EU gE. This will be 
a nested set. In fact it will be the edge set of a tree that is the union 
of the trees T(H) and T(K). In the diagram the very thick edges are 
the edges that are just in T(H). The thick edges are the ones that are 
in T(K). The other edges are in both T(H) and T(K). An edge is in 
the geodesic joining o and go if and only if it has stabiliser containing 
AK, it will also lie in both T(H) and T(K) if and only if its stabiliser 
contains HM K as a subgroup of finite index. It may be the case that 
T(H) and T(K) have no edges in common. An edge lies in both trees 
if and only if it has a stabiliser that is (H M K)-finite. If there are such 
edges then they will be the edge set of a subtree of both trees. They will 
correspond to the edge set E(H NK). 

It follows that T(#) is always a subtree of a tree constructed from 
a subset of GE, that contains €,. If T(H) and T(K) do have an edge 
in common, then T(H) U T(K) will be a subtree of the tree we are 
constructing. If e €¢ EX has vertices go and ko and there is some C € 
GEn that has e € 5C, then C € gET(g~!Hg) NKET(k—! Hk). If there is 
no such C, i.e. there is no cut C € GEn that separates o and k-1go then 
T(A) = k-'gT(H). As there is a finite path connecting any two vertices 
u, v in X, it can be seen that there are only finitely many edges in GEn 
separating u and v since any such edge must separate the vertices of one 
of the edges in the path. Thus GE, is the edge set of a tree. 
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We say that a G-tree T is reduced if for every e € ET, with vertices 
te and te we have that either ve and te are in the same orbit, or Ge is 
a proper subgroup of both Gie and Gye. 


Theorem 3.5. Let G be a group that is finitely generated over a 
subgroup H. The following are equivalent: 


(i) There is a proper H-almost invariant set A = HAK with left 
stabiliser H and right stabiliser K, such that A and gA are nested 
for everyg eG. 

(ii) There is a reduced G-tree T with vertex v and incident edge e 
such that Gy = K and G. = H. 


Proof. It is shown that (ii) implies (i) in the Introduction. 

Suppose then that we have (i). We will show that there is a G-tree— 
in which G acts on the right—which contains the set V = {Az|x € G} 
as a subset of the vertex set. Let « € G, then A+ Az is a union of 
finitely many cosets Hg1, Hgo,..., Hgx. Then to, A. 95 A, sles g, A} is 
the edge set of a finite tree F'. We know that the set {gAlg € G} is the 
edge set of a G-tree T provided we can show that it satisfies the finite 
interval condition. But this must be the case as the edges separating 
vertices A and Az will be the edges of F. 


Theorem 3.6. Let G be a group and let H be a subgroup, and suppose 
G is finitely generated over H. There is a proper H-almost invariant 
subset A such that A= AH, if and only if there is a non-trivial reduced 
G-tree T in which H fixes a vertex and every edge orbit contains an edge 
with an H-finite edge stabilizer. 


Proof. The only if part of the theorem is proved in Theorem 3.5. In 
fact it is shown there that if G has an action on a tree with the specified 
properties, then there is a proper H almost invariant set A for which 
HAH = A. 

Suppose then that G has an H-almost invariant set A such that 
AH = A. Since G is finitely generated over H, we can construct the 
G-graph X as above, in which A can be regarded as a set of vertices for 
which 6A lies in finitely many H-orbits. Let this number of orbits be n. 
Then we have seen that there is a G-tree T. n for which H fixes a vertex 0 
and every edge is in the same G-orbit as an edge in T(H). The edges in 
this tree are H-finite. The set A has an expression in terms of the edges 
of T(H). Finally we need to show that the action on Ty is non-trivial. 
If G fixes 0, then v(A) consists of the single vertex o and so A is not 
proper. In fact the fact that A is proper ensures that no vertex of Tp is 
fixed by G. 

It can be seen from the above that T(H)N T(g~! Hg) a T(HNgHg"!) 
so that if e € ET(H), and g € Ge, then e € T(gHg—!) and so Ge is 
H-finite. 


The Kropholler conjecture follows immediately from the last theorem. 
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4. H-Almost Stability 


Let G be a group with subgroup H, and let T be a G-tree. 

Let Ac VT be such that 6A C ET consists of finitely many H-orbits 
of edges e such that Ge is H-finite. Also let H fix a vertex of T. Note 
that 5A consists of whole H-orbits, so that e € A implies he € 6A for 
every h € H. The fact that G, is H-finite for e € 6A follows from the 
fact that 6A is H-finite. If He is the stabiliser of e € 5A, then [Ge : He| 
is finite. 

Let v € VT, and let A = A(v) = {g € Glgu € A}. Note that A(av) = 
A(v)x~", so that the left action on T becomes a right action on the sets 
A(v). If x € G and [v, zu] is the geodesic from v to rv, then g € A+ Ar 
if and only the geodesic |gv, grv| contains an odd number of edges in 6A. 
If [v, zu] consists of the edges e1, €2,..., er, then ge; € 6A if and only if 
Hge; € 5A. It follows that H(A + Ar) = A+ Az. It is also clear that 
for each e; there are only finitely many cosets Hg such that Hge; € 6A. 
Thus A is H-almost invariant. We also have A(v)H = A(v) if H fixes v. 

For each e € ET, let d(e) be the number of cosets Hg such that 
Hge € 5A. We see that d(e) = d(zxe) for every z € G and so we have 
a metric on VT, that is invariant under the action of G. We will show 
that if G has an H-almost invariant set such that HAH = A then there 
is a G-tree with a metric corresponding to this set. 

From now on we are interested in the action of G on the set of H- 
almost invariant sets. But note that we are interested in the action by 
right multiplication. The Almost Stability theorem [3], also used the 
action by right multiplication. Let A C G be H-almost invariant and let 
HA = A. For the moment we do not assume that AH = A. 

Let M = {B|B =, A} so that for B, Ce M, B+ C= HF where F is 
finite. 

Note that for H = {1} it follows from the Almost Stability theorem 
that M is the vertex set of a G-tree. 

We define a metric on M. For B,C € M define d(B,C) to be the 
number of H-cosets in B+ C. 

This is a metric on M, given that (B+ C)+(C+ D) = (B+D), and 
so an element which is in B+ D is in just one of B+ C or C+ D. Thus 
d(B,D) < d(B,C)4+ d(C, D). 

Also G acts on M by right multiplication and this action is by 
isometries, since (B+ C)z = Bz+ Cz. Let T be the graph with VT = M 
and two vertices are joined by an edge if they are distance one apart. 
Every edge in I corresponds to a particular H-coset. There are exactly 
n! geodesics joining B and C if d(B,C) = n, since a geodesic will 
correspond to a permutation of the cosets in B + C. The vertices of 
I on such a geodesic form the vertices of an n-cube. 

The edges corresponding to a particular coset Hb disconnect I, 
because removing this set of edges gives two sets of vertices, B and 
B*, where B is the set of those C € M such that Hb c C. 
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It was pointed out to me by Graham Niblo that I is related to the 
1-skeleton of the Sageev cubing introduced in [29]. For completeness we 
describe this connection. 

Let G be a group with subgroup H and let A = HA be an H-almost 
invariant subset. Let 


x = {gAlg € G}U{gA*|g € G}. 


We define a graph I’. A vertex V of I’ is a subset of © satisfying the 
following conditions: 


(1) For all B € IT’, exactly one of B, B* is in V. 
(2) If Be V,CeXand BCC, then Ce V. 


Two vertices are joined by an edge in I” if they differ by one element of 
xX. For g € G, there is a vertex V, consisting of all the elements of & 
that contain g. Then Sageev shows that there is a component [! of I’ 
that contains all the Vg. 

By (1) for each V € & either A € V or A* € V but not both. Let 
Za be the subset of © consisting of those V € ¥ for which A c V. The 
edges joining Y4 and X% in T! form a hyperplane. Each edge in the 
hyperplane joins a pair of vertices that differ only on the set A. For each 
“A there is a hyperplane joining vertices that differ only on the set «A. 
Clearly G acts transitively on the set of hyperplanes. 

With V as above, consider the subset Ay of G 


Ay ={xe€ Gla Ae V}. 


Then HAy = Ay and Ay, = A. Also Ay +A is the union of those cosets 
Hx for which V and V, differ on 2! A, which is finite. Thus Ay € VI. 

Thus there is a map VP! > VI in which V& Ay. This map is an 
injective G-map. 

If the set A is such that A and gA are nested for every g € G, then 
l! is a G-tree. Thus VI contains a G-subset that is the vertex set of a 
G-tree. 

In IT a hyperplane consists of edges joining those vertices that 
differ only by a particular coset Hx. Every edge of I belongs to just 
one hyperplane. The group G acts transitively on hyperplanes. The 
hyperplane corresponding to Hz has stabilizer «~! Ha. 

Suppose now that A is H-almost invariant with HAK = A. Here 4H is 
the left stabiliser and K is the right stabiliser of A, and we assume that 
HT < K,so that in particular HAH = A. Note that it follows from the fact 
that A is H-almost invariant that it is also K almost invariant. Suppose 
that G is finitely generated over K. We saw in the previous section that 
there is a G-tree T in which A uniquely determines a set A of vertices 
with H-finite coboundary 6A. Here T = Ty for n is sufficiently large 
that in the graph X—as defined in the previous section—the set 5A is 
contained in at most n H-orbits of edges. Note that if e is an edge of 
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T(H) = €(H,X), then de is H,-finite, and will consist of finitely many 
H--orbits. It is then the case that [Ge : He| is finite, as de will consist of 
finitely many G,-orbits, each of which is a union of [G. : He] H--orbits 
of edges. 

We also know that K fixes a vertex 6 of T, and that HSA = 6A. Thus 
5A consists of finitely many H-orbits of edges. We can contract any edge 
whose G-orbit does not intersect 5A. We will then have a tree that has 
the properties indicated at the beginning of this section. Thus A c VT 
is such that 5A C ET consists of finitely many H-orbits of edges e such 
that G_. is H-finite. We see that the metric d on M is the same as the 
metric defined on VT. Explicitly we have proved the following theorem 
in the case when G is finitely generated over K. 


Theorem 4.1. Let G be a group with subgroup H and let A= HAK 
where H < K and A is H-almost invariant. Let M be the G-metric space 
defined above. Then there is a G-tree T such that VT is a G-subset of 
M and the metric on M restricts to a geodesic metric on VT. Ife € ET, 
then some edge in the G-orbit of e has H-finite stabiliser. 


This is illustrated in Figures 5.7 and 5.8. 


Proof. It remains to show that the theorem for arbitrary G follows from 
the case when G is finitely generated over K. Thus if F is a finite subset 
of G, then there is a finite convex subgraph C' of I containing AF. 
We can use the graph X of the previous section for the subgroup L of G 
generated by HUF to construct an L-tree which has a subtree S(F’) with 
vertex set contained in VC’. These subtrees have the nice property that 
if Fy C Fo then S(F{) is a subtree of S(f2). They therefore fit together 
nicely to give the required G-tree. We give a more detailed argument for 
why this is the case. We follow the approach of [1]. 


Figure 5.7 Finding a tree in M 
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Figure 5.8 Finding a tree in M 


Let M’ be the subspace of M consisting of the single G-orbit AG. 
Define an inner product on M’ by (B.C), = 5(d(A, B) + d(A, C) - 
d(B, C)). 

This turns M’ into a 0-hyperbolic space, i.e. it satisfies the inequality 


(B.C)4 = min{(B.D)a,(C.D)a} 


for every B, C,D € M’. This is because we know that if L < G is finitely 
generated over H, then there is an [-tree which is a subspace of M. But 
A, B,C, D are vertices of such a subtree which is 0-hyperbolic. It now 
follows from [1], Chapter 2, Theorem 4.4 that there is a unique Z-tree 
VT (up to isometry) containing M’. The subset of VT consisting of 
vertices of degree larger than 2 will be the vertices of a G-tree and can 
be regarded as a G-subset of M containing M’. 
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Abstract 

We will give a criterion for the amenability of arbitrary locally finite 
trees. The criterion is based on the trimming operator, which is defined 
on the space of trees. As an application, we obtain a necessary and 
sufficient condition for that amenability of Galton—Watson trees. 
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Introduction 


The notion of amenability for groups emerged out of von Neumann’s 
effort [vN29] in 1929 to find the underlying reason for Hausdorff’s 
paradox. It is his observation that once the dimension of a Euclidean 
space F' exceeds two, the group of isometries of E will contain a copy of 
the free group on two generators and hence fails to be amenable. This 
copy of the free group can then be used to carry out various paradoxical 
decompositions, which are analogous to Hausdorff’s original result. 
Since its inception, the theory of amenable groups has been explored 
and the notion of amenability has been extended to a broad class of 
algebraic objects. Zimmer [Zim78] found that a non-amenable group 
may have actions that share many properties of the actions of amenable 
groups and thus initiated the theory of amenable group actions. 
Amenability of a discrete group can also be formulated in terms of 
its Cayley graph. Before giving this definition, let us recall the notion 
of the Cayley graph for a (finitely generated) group. Recall that if G is 
a group generated by a (symmetric) set = of generators, then Cayley 
graph T = P(G,%) is an undirected graph with the underlying set of 
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G as its vertex set in which vertices g1,g2 € G form an edge when 
I : g2 € &. For instance, one can readily see that the Cayley graph of 
a free group on & generators with respect to the standard generating 
set is isomorphic to the (unique) 2k-regular tree. For a subset A of 
vertices of [, the boundary @A, by definition, consists of those vertices 
in A that have a neighbor in V(I)\A. A family A, C G is then 
called a Fglner family, if |AAn|/|An| > 0, as n > on. It is a classical 
result that a finitely generated group G is amenable iff such a Folner 
family exists, see [Fol55]. This definition lends itself to using many 
other methods to establish the amenability of a group. For instance, in 
various works by Bartholdi, Kaimanovich, Nekrashevych, Amir, Angel, 
and Virag ([Kai05], [BKN10], [AAV13]) random walks have been used 
to prove the amenability of several self-similar groups. Many definitions 
and statements about amenable groups carry over almost verbatim to 
amenable graphs. For instance, Gerl [Ger88] showed that a connected 
graph is amenable if and only if the spectral radius of simple random 
walk on the graph is strictly less than 1. This result is a generalization 
of Kesten’s result for amenable groups [Kes59]. 

Note that the Cayley graph of a group G is homogeneous, i.e., 
its automorphism group (containing G as the subgroup of “internal 
symmetries”) acts transitively on the vertex set of G. In particular, 
the only trees that can be realized as Cayley graphs are those of free 
products of cyclic group, which, except in trivial cases, are non-amenable. 
In this note, we will take up the question of characterizing amenability 
for arbitrary locally finite trees. Our point of departure is a group of 
results proved by Gerl and Woess who investigated this property for 
trees that do not have degree-one vertices. Recall that a branch is a 
vertex with degree at least three. It is clear that a graph which contains 
arbitrarily long paths without branches is amenable. Gerl [Ger86] proved 
a tree without any leaves with uniformly bounded degrees is amenable 
if and only if there are arbitrarily long paths without any branch as 
induced subgraphs. Later, Woess [Woe00, p. 114] improved this result 
by dropping the uniform finiteness condition. (see Theorem 2.7). One 
can easily see that the assumption that T has not vertices of degree one 
cannot be dropped (see Example 2.8). 

Our first theorem extends this characterization to arbitrary trees and, 
en passant, also supplies a rather elementary and “probability-free” proof 
of that result too. In order to state the theorem, we will need to introduce 
some new terminology. Our central new concept is the trimming operator 
© defined on the space of countable trees. Intuitively, trimming a tree 
amounts to removing all vertices of degree 1. Hence, the set of fixed 
points of this operator are precisely the trees without leaves (but see 
Example 3.3). Using the trimming operator, we will define inessential 
subtrees of a tree which are, roughly speaking, finite subtrees hanging 
from a vertex of the main tree. Next to long paths, inessential subtrees 
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can provide another source of Fglner sets for infinite trees. Our theorem 
roughly says that these are the only underlying reasons for the amenabil- 
ity of a tree: 


Theorem 1. Let T be an infinite tree. Then T is amenable if and only 
if T contains arbitrarily large inessential trees or for some k > 1, @*(T) 
has arbitrarily long paths. Moreover, the former is always the case if T 
can be trimmed indefinitely. 


This theorem can then be applied in a probabilistic setting. There are 
many models for random trees. The oldest and perhaps the most well- 
known one is the family of Galton—Watson trees. In this context, we have 
the following theorem: 


Theorem 2. The Galton—Watson tree T associated to the probability 
distribution (pi)i>o ts almost surely amenable, if and only if P| Xz < 1] = 
po + pi > 0. 


This chapter is organized as follows. In Section 1, we will define the 
graph-theoretical terminology that is freely used throughout the chapter. 
Section 2 is devoted to stating general facts about amenability. A simple 
proof of Theorem 1 in the special case of trees without leaves is also given 
in this section. In Section 3, the trimming operator and inessential trees 
are introduced and studied. Finally, in Section 4, we show an application 
of Theorem 1 in the context of Galton—Watson trees. 
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1. Preliminaries 


In this section, we will define the basic graph-theoretic terminology and 
set the notations used in this chapter. A (undirected) graph G consists 
of a non-empty set V(G) called the vertices and a family of 2-element 
subsets of V(G), called the edges of G, and denoted by E(G). For 
brevity, the edge {u, v} with vertices (also called endpoints) u and v will 
be denoted by uv, hence uv = vu. The set of neighbors of a vertex v, 
denoted by N(v), consists of the vertices u € V(G) with wv € E(G). 
Correspondingly, for a subset A C V(G), we set N(A) = U,e4 N(a). 
The degree of a vertex is given by degv = |N(v)|, where |X| denotes 
the cardinality of set X. All the graphs considered in this chapter are 
assumed to be locally finite, that is, degv < oo for all v € V(G). A 
leaf is a vertex of degree 1. The set of leaves of a graph G is denoted 
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by L(G). For two leaves u,v € G, we write u~ v if N(u) = N(v). 
A branch is a vertex with degree strictly more than 2. For a non- 
empty subset A C V(G), the induced subgraph G[A] is the graph with 
V(G[A]) = A and E(G[A]) = {uv € E(G) : u,v € A}. Similarly, for 
a non-empty subset R of edges of G, the edge-induced subgraph G(R) 
has R as the set of edges and the set of endpoints of R as the vertices. 
A path of length n between two vertices u and v is a finite sequence 
of vertices 7% = U,%1,...,%m = v such that 2; and 241 are neighbors 
for i= 0,1,---n—1. A tree T is a graph such that for any two vertices 
u,v € V(T), there exists a unique path joining u to v. This path (viewed 
as in induced subgraph of T) will be denoted by [wv]. We say that a graph 
G contains arbitrarily long paths without branches if for any n, there 
exist vertices vj,...,Un € V(G) such that v; is connected to vuj+1 for 
1<i<nand for 1 <i <n, the degree of v; in G is exactly 2. 


2. Amenability of Graphs 


In this section, we recall the definition of amenability for graphs and 
groups. Moreover, we will give a complete characterization of amenable 
trees without leaves. 


Definition 2.1. Let A be a subgraph of graph G. The boundary of A 
consists of those vertices of A which are connected to at least one vertex 
outside A. We will denoted the boundary by 0A. Hence, 


dA={ve V(A) : we E(G), for some u € V(A%}, 
where |A| = |V(A)|. 
We will also need the following definition. 


Definition 2.2. Define the isoperimetric number or Cheeger constant 
of a graph G as follows 
|0.A| 
| A| 

The graph G is called amenable if i(G) = 0. 

Equivalently, the graph G is amenable if and only if there is a sequence 
of finite subgraphs (Ay)n>1 of G such that 

J9An| 


1 
n—0o |A 


i(G) = int | : Ais a non-empty and finite subgraph of a 


nl 

Such a sequence (A,,)n>1 witnessing the amenability is called a the 
Fglner set. Let us make two remarks about the Folner set: first, one can 
always exchange a Fglner set with one that consists of finite connected 
graphs. This follows from the following easy lemma: 
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Lemma 2.3. Jf A is a finite subgraph of G, such that |dA| < €|Al, 
then there is connected subgraph B of A, such that |AB| < «|B. 


Proof. Let Aj,,A2,---,An are finite connected components of A. 
Because A;js are pairwise disjoint, we have |dA| = |0A1| +-+ |dAn|. 
Therefore, |0A,| +--+ |dA,y| = JOA] < €[A] = €JAly +--- + €[Anl. 
Hence, there exists 1 < i < n such that |dA;| < €|A;|. 


Second, the Felner set can be chosen to exhaust V(G): 


Proposition 2.4. Let (An)n>1 be a Folner sequence for the graph G. 
Then there exists Folner sequence (A),)n>1 such that Uns A, = G.- 


Proof. Let By, be a sequence of finite subgraphs such that their union is 
the whole graph G. By induction, define Aj, = B,UAx,,, where \/|Az,,| = 
|Bn|. Then 
J9An| — |Bni + 10 Ax, | 
[An] [Ai | 


as n goes to infinity. 


We can now give the definition of amenability for countable groups. 


Definition 2.5. A countable group G is called amenable whenever its 
Cayley graph admits a Folner set, i.e., there exists a sequence of finite 
subsets (An)n>1 such that 
AnAA 

fim RE 2 

w |An| 
for every g € G, where gAn = {ga : a € An} and AnAgAn is the 
symmetric difference of two sets A, and gAn. 


Let G be a finitely generated group with a symmetric finite set X. 
The Cayley graph of the group G is a graph whose vertices are elements 
of G and two vertices of g; and g2 are connected if there is an element 
s € DY such that gis = go. A finite generated group G is thus amenable 
if and only if its Cayley graph is amenable. 


Proposition 2.6. Let T be a tree which does not have any vertex with 
degree less than 3. Then for every finite subtree A, we have |A| < 2|d Al. 
Consequently, T is not amenable. 


Proof. Let A bea finite subtree of T. Define Kj = {u € A : degu = 7}. 
Hence, the boundary of A at least includes the vertices with degree 1 
and 2, hence |dA| > |A | + | K2|. It is well known that 


2|E(A)| = se deg(v) =) il Kil. (6.1) 
ve V(G) w>1 
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On the other hand 
|V(A)| = 0 1KGI. (6.2) 
i>1 
Since A is a tree, |E(G)| = |V(A)| — 1. Equalities (6.1) and (6.2) now 
imply 
|Kil = }o( — 2)| Kil + 2. 
1=3 


We now have 2|9.A| > |Ki| + |Ko| + Djs3(é — 2)|Kil = jor |Kil- 


As mentioned, Woess [Woe00, p. 114] classified amenable trees without 
any leaves. Here we provide an alternative proof for this result. 


Theorem 2.7. [Woe00, p. 114] Let T be an infinite tree with no leaves. 
Then T is amenable if and only if T contains arbitrarily long paths 
without any branch. 


Proof. Let T contain arbitrary long paths without branches. For each 
n, let An be a finite subtree of T with exactly n vertices with degree 2. 
Hence, |An| = n+ 1 and |dAp| < 2. Consequently, (An) is a Felner set. 
Assume T does not contain arbitrarily long paths without branches. 
Let T’ be a tree obtained after removing all vertices of T whose degrees 
are 2. Because T does not have any leaves, by Proposition 2.6, T’ is not 
amenable. If A is a finite subtree of 7, then corresponding subtree A’ 
in T’ has the same boundary as A’ and clearly |A’| < |A|. In addition, 
each edge of A’ is obtained by removing at most d vertices, where d is 
the longest path without any branch. In other words, |.A’| < d|A|, and 


|aA’| aA] _ aA’ 
= eer 
|A") ~ Al |A"| 


(6.3) 


Combining the preceding inequalities and the fact T’ is not amenable 
imply non-amenability of T. 

The preceding theorem is not true if the trees are allowed to have 
infinitely many leaves, as the following example shows. 


Example 2.8. Let T be a tree which can be obtained by attaching 
one vertex and one edge to the Cayley graph of Z with respect to the 
generating set {—1, 1} (see, Figure 6.1). Then T is amenable, but T does 
not contain arbitrarily long paths without any branch. 


Remark 2.9. One has to note that some properties of amenability for 
graphs may diverge from amenability of groups. For instance, it is known 
that every subgroup of a (discrete) amenable group is amenable. The 
analogous property does not hold for trees. This can be easily seen as 
follows: let T be a 3-regular tree and J” be the tree obtained by adding 
an infinite ray Z (the graph with vertices 1,2,... where i and j are 
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Figure 6.1 Example 2.8 


Figure 6.2 Example 2.8 after trimming 


adjacent when |i — j| = 1) to one of the vertices of T. In other words, 
let T’ be the graph obtained by taking the disjoint union of T and Z 
and identifying vertex 1 of Z with one of the vertices of T. Clearly T’ 
contains arbitrary long paths without any branch and is hence amenable, 
but it contains T as a subtree which is not amenable. One can modify 
this example to construct an example of a tree T’ and a subtree T with 
the same set of ends as T such that T’ is amenable, but T is not. 


3. Trimming and Inessential Subtrees 


In this section, we will define certain operators on the space of trees. 
These definitions will later be used to give a criterion for amenability of 
trees. 


Definition 3.1. Let J be an infinite tree. The trimming operator 
©(T) is defined by 
O(T) = T|V(T)\L(T)]. 


In other words, @(T) is the tree obtained by removing all the leaves of 
T together with their incident edges. 


Remark 3.2. We will always view ©(T) as an induced subtree of T. 
For instance, T has no leaves iff ©(7) = T. Note that 


TD O(T)D@7(T)D-:- 
is a decreasing sequence of subtrees of T. Also define, @°(T) = T and 
e*+1(T) = @(@*(T)), for k > 0. 
We say that the trimming stops in finite time if this sequence stabilizes 
at some point, ie., if there exists k > 0 such that O*(T) does not 


have any leaves, which is equivalent to @'(T) = O*(T) for all 1 > k. 
Otherwise, we say that T can be trimmed indefinitely. 


Example 3.3. Although © is defined on the space of trees, it also 
induces a map on the space 7 of isomorphism classes of trees. Let us 
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Figure 6.3 Example 3.3 


denote the isomorphism class of a tree T’ by [T]. We can now study the 
dynamics of © on J and pose various questions about it. For instance, 
finite trees [ T] are exactly those trees whose orbit contains the empty tree 
(the tree with one vertex and no edge). For a more interesting example, 
let T be the tree with V(T) = {(i,j) € Z* : 0 <j < 4}, and the 
edges between (7,0), (¢+1,0) for 2 > 0 and (4,7) and (2,7 +1) for every 
i> 1land0<j<i-—1, see Figure 6.3. It is easy to see that T can 
be trimmed indefinitely, while ©(T) is isomorphic to T. In other words, 
[T] is a fixed point for ©. Clearly, if L(T) = 4, then O(T) = T. It is an 
interesting question to characterize those trees with L(T) 4 @, for which 
©([T]) =[T]. One can analogously define for any n > 1 the tree T;, by 


V(Tn) = {G,j) € Z? 20 <j < ni}, 


with the edges similar to those of T above. In this case, one can see that 
[Tp] is a periodic point of © with the smallest period n. 


We will now define the notion of inessential subtree and use it to prove 
some of the properties of finitely trimmable trees. 


Definition 3.4. Let 70 be a finite (connected) subtree of an infinite 
tree T with at least one edge. We say that To is inessential if the edge- 
induced subgraph G(E(T)\E(Jo)) is connected, and hence is a tree. 


For an inessential subtree Tp of T, set Tp = G(E(T)\E(T)), and 
note that E(G) is the disjoint union of the edge set of the trees Tp and 
‘To. This implies that Tp and 7p have exactly one vertex in common. We 
call this vertex the root of To and denote it by root(7). Note that since 
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root(7) is adjacent to a vertex in To, as well as one in 7p, its degree is 
at least 2. Let us derive some immediate consequences of this definition. 


Proposition 3.5. If T, and T2 are inessential subtrees of an infinite 
tree T with root(T,) = root(T2), then Ty U T2 is also an inessential 
subtree of T. 


Proof. Because T, and T> have a common vertex, Tj, U T> is connected, 
and hence a subtree of T. Also Tj U Tz = T1/Q T4 is the intersection of 
two trees that have a common vertex root(71) = root(72), and is hence 
connected. 


Proposition 3.6. An infinite tree has an inessential subtree if and 
only if it has a leaf. 


Proof. Let v be a leaf of T and w be the unique vertex of T adjacent 
to v. Then the single edge vw is an inessential subtree of T. Conversely, 
if T’ is an inessential subtree of a tree T, then T”’ has at least two 
vertices of degree 1. Call them u’ and v’. We claim that at least one of 
wu’ and v’ has degree 1 in T. If, on the contrary, u’ and v’ are adjacent 
to vertices u and v in V(T)\V(T’), respectively, then the unique path 
from u to v will contain both u’,v’ € V(T’). Now consider the path 
joining u and v in G(£(T)\E(T’)). The unique path between wu’ and v’ 
in T must contain this path, and will hence depart T’. This contradicts 
the assumption that T’ is connected. 


Lemma 3.7. If O(T) contains an inessential subtree with k vertices, 
then T contains an inessential tree with at least k + 1 vertices. 


Proof. Let To be an inessential subtree of @(7) and v be one of its 
leaves. Because v is not a leaf of 7’, it must be connected to at least one 
vertex of V(T)\V(©(T)), which is automatically a leaf of T. The tree 
added by adding all such vertices w and the corresponding edges vw to 
To is connected and hence an inessential subtree of T with at least k+1 
vertices. 


Proposition 3.8. Let T be an infinite tree that can be trimmed 
indefinitely. Then T is amenable. 


Proof. We show that T contains an inessential subtree with arbitrarily 
large number of vertices. Since @*(T) contains a leaf, hence it contains 
an inessential subtree by Proposition 3.6. Now, a repeated application 
of Lemma 3.7 shows that T contains an inessential tree To with at least 
k vertices. Let S be the set of all vertices of To except for the root. It is 
clear that |S| = k — 1 and 0S = {root(7o)}. This shows that 


jas} 


(S| k=-V 


AMENABILITY OF TREES 185 


Hence, by letting k — oo, we obtain a sequence of Fglner sets in the 
tree. 


Lemma 3.9. Let T be an infinite tree such that ©(T) is amenable. 
Then T is amenable. 

1, 195| = 
Proof. Let S be a connected subgraph of O(T) with TS) <e€.Set S to 
be the connected subgraph of T obtained by adding all of the leaves of 
T that are connected to a vertex of S. Note that since the only vertices 
of T \ @(T) are leaves of T, the boundary of S in T’ is equal to the 


= as 
boundary of S in @(T). Since |S| > |S|, we have eel <e€. 


[5 


We can now prove the main result of this section which is the 
generalization of Theorem 2.7. 


Theorem 3.10. Let T be an infinite tree. Then T ts amenable if and 
only if T contains arbitrarily large inessential trees or for some k > 1, 
@*(T) has arbitrarily long paths without branch. Moreover, the former 
is always the case if T can be trimmed indefinitely. 


Proof. The same argument as in Proposition 3.8 shows that if T 
contains arbitrarily large inessential trees, then it is amenable. Hence, 
without loss of generality, we can assume that there exists k > 1 such 
that @*(T) does not have any leaves but contains arbitrarily long paths. 
The proof now follows by induction on k. For k = 0, we can take these 
long paths without branch as F¢lner sets. Since @*(T) = e*-l(@(T)), 
by induction hypothesis, we obtain that ©(T) is amenable. Hence, using 
Lemma 3.9, T is amenable. 

Let us now prove the converse. Assume that there exists k > 0 such 
that ©” (T) contains no leaves and does not contain arbitrarily long 
paths, and that the largest inessential tree of T’ has cardinality R. We 
will show that T is nonamenable. 

We will assume that @*(T) = O*+1(T) = T’, ie., T can only be 
trimmed k times and T’ does not have arbitrarily long paths. This 
implies that T is obtained from T” by adding a (possibly infinite) number 
of inessential trees, each attached at a distinct vertex of T. 

Let us now assume that A is a connected subgraph of T. 

Let A’ = AN V(T7"). We claim that 


1 
|A"| > Rial: 


Also the cardinality of the boundary of A’ in T’ is the same as the 
cardinality of the boundary of A in 7. This implies that 
|0.A’| |dA| 
Ag ees 
|A’| |A| 
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We have now reduced the problem to the case that the tree does not 
have any vertices of degree 1. 


4. Application: Amenability of Random Trees 


In this section, we will consider the question of amenability for Galton— 
Watson trees. First we will give some basic definitions regarding the 
Galton—Watson process. For details, the reader is referred to [AN72]. 

Let t = (pn)n>0 be a distribution on the set of non-negative 
integers. We will define the Galton—Watson process associated to m7 as a 
distribution on the set of rooted labeled trees. We will start by setting 
up the notations that will be used in this section. 

A Galton—Watson tree is always designated with a distinguished vertex 
refereed to as the root and denoted by @. Set 


(oe) 
T= {(9}U| JN’ 
j=l 
where N is the set of positive integers. For each I = (i1,...,7%) € Z, we 


define its length (or generation) by |I| = k. We will also set |@| = 0. A 
set J CT is called inductive if it satisfies the following properties: 


(1) BET. 

(2) For each k > 1 and I = (i,...,%%) € J, we have — 
(41,---,%-D ES. 

(3) For each k > 1 and I = (i,...,%%) € J, if t% > 2, then 
(41,---,%4%-1,4, -De JF. 


If J = 1, we say that J is an ancestor of J, and that J is a descendant 
of I. Intuitively, an inductive set is a set that is closed with respect 
to the ancestor operation and moreover, the set of offsprings of any 
vertex are always labeled from 1 to & for some k > 0. Let X7, I € I 
be a sequence of independent identically distributed random variables 
with distribution 7. A random rooted tree is constructed as follows: the 
root uy has Xg direct offsprings (also called children) denoted by wj,), 
for 1 < 4% < Xg. These vertices are called the first generation. From 
here, the construction continues inductively. Assume that the vertices of 
generation £ have been constructed. Each vertex in generation @ is of the 
form v; for some I = (1,...,%¢), hence || = €. The vertex vy has X7 
children, namely (41... ip,ig44), Where %+1 ranges from 1 to X7. It is easy 
to see that the set of vertices of the tree 7 thus constructed is of the 
form V(7) = {vy : J € J}, where J is an inductive set. We will denote 
the set of vertices in generation € by V(7)¢ and set We = |V(7)¢|. The 
rooted subtree of 7 consisting of all the offsprings of vertex vz rooted at 
vy will be denoted by T/. The (finite) rooted subtree of T consisting of 
all vertices of the first k generations will be denoted by 7x. 
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U1 v2 


U21 U22 


Remark 4.1. Before we proceed to the proof, a few remarks are in 
order. There is a vast literature on the Galton—Watson process. Let m = 
i[X7] = 0 jp; be the expected number of children of any vertex. It 
is a classical theorem that if p, #1, then 7 is almost surely an infinite 
tree iff m > 1. If py = 1, then T will be isomorphic to an infinite path 
and hence amenable. From now on we will always exclude this case. 
The cases m = 1 and m < 1 are usually referred to as the critical and 
subcritical case. Since every finite tree is by definition amenable, we can 
condition on the non-extinction of T. 


Proof of Theorem 2. First note that if pp = py = 0, then 7 is almost 
surely infinite and the degree of every vertex, except possibly for the 
root, is at least 3. Such a tree is clearly non-amenable. We will now show 
that if po > 0 or pi > 0, then the tree is amenable. Let m = )°?, kpp. 
By the above remark, we can assume that m > 1. First assume that 
m < oo. We know that E[W,,] = m”. We will distinguish two cases: 

Case 1: Assume that pp = 0 and 0 < py < 1. We will start by 
two observations. First, it is easy to see that in this case Wy is a non- 
decreasing sequence. Moreover, Wn+1 = Wry, if each vertex in generation 
n has exactly one offspring. This implies that if k > 1, 


[Wrst > WnlWn =k| =1—pk>1—pi >0. 


Hence P[Wp4i1 > Wp] = 1— pi, and an application of Borel—Cantelli 
shows that P| Wp, — oo] = 1. Second, with probability q = ot > 0 
each vertex in the the first d generations has exactly one child, i.e., the 
tree 7q41 is isomorphic to Pg4y. 

For any vertex vy in the nth generation, consider the rooted subtree T/ 
at vy. Note that of T/ are i.i.d. random trees with the same distribution 
as T. By the second observation above, each T/ has probability q of 
being isomorphic to a path of length d+ 1. Let Ap be the event that at 
least one of these subtrees is isomorphic to a path of length d+ 1. For a 
fixed r, we have 


P[An] = P[An| Wn > r]P[ Wn > 1] 
=(1-(1— 9)")P[Wn > r]>1-A-9)', 
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as n — oo. Given that r is arbitrary, we have P[A,| — 1, as n > oo. 
This means that with probability 1, 7 contains a path of length d+ 1 
for every d > 1, which proves the almost sure amenability of T. 

Case 2: Let us now consider the case that po > 0. Note that in this case 
there is no guarantee that Wn — oo as n > oo. Fix d > 1. We will show 
that, with probability 1, the isoperimetric constant of TJ is at most 1/d. 
The large Folner sets in this case arise from the following dichotomy: 
for an appropriate value of n >> 1: (a) either there are “many” vertices 
in generation n, in which case, with high probability, the subtree of 7 
starting from one of them must terminate exactly after d generations, 
i.e., the first d generations starting from one of these vertices must be 
a finite tree with all vertices of degree 1 in generation d, or, (b) there 
are “few” vertices in generation n, which implies that the first n — 1 
generation of the graph forms a large set with a small boundary. Let 
us make this idea precise. Fix d > 1, choose s > 0 such that ps > 0. 
Let Tq denote the (deterministic) finite rooted s-ary tree of depth d, 
that is, a rooted tree, where starting from root up to generation d — 1, 
each vertex has exactly s children, but the vertices in generation d have 
no any children. Let q be the probability that 7g41 is isomorphic T, q. 
Since po > 0, ps > 0, we have q > 0. 

Let F denote event that 7 is finite, or equivalently, that Wy = 0 for 
n >> 1. It suffices to show that 7 is amenable conditioned on E£°. Let Ag 
denote the event that 7 contains a subtree with isoperimetric constant 
at most 1/d. For any r > 1, choose n > rd, and note that given that T 
is infinite and W, > r, Aq will take place if at least one of the subtrees 
starting from one of the vertices in generation d is isomorphic to Ts q. 
Because there are at least r vertices in generation n, we have 


P[Agl|E°N{W, > r}}>1-A-@'. 


On the other hand, if there are at most r vertices in generation n, then 
the subtree 7,1, which has at least n vertices, has a boundary of size at 
most r, implying that its isoperimetric constant is at most r/n < 1/d. 
Hence, 


P[Agl|E°N{Wp < r}| =1. 
Combining the two cases, we have that for given r,d > 1, 
P[AglE’]>1-G-@)". 
Because r is arbitrary and q > 0, we deduce that for any d > 1, 
P[Ag|£‘°] = 1, implying that 7 almost surely has a set with isoperimet- 


ric constant at most 1/d, for every d, proving the almost sure amenability 
of 7 in this case. 
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Abstract 

We introduce a construction that gives rise to a variety of ‘geometric 
finite random graphs, and describe connections to the Poisson boundary, 
Naim’s kernel, and Sznitman’s random interlacements. 
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1. Introduction 


The purpose of this chapter is to introduce a new construction of 
‘geometric’ finite random graphs, called group-walk random graphs 
(GWRGs from now on), and describe the rich connections to other 
objects, including the Poisson boundary, Naim’s kernel, and Sznitman’s 
random interlacements. GWRGs do not only yield new interesting 
examples of random graphs, but, as we will argue, they can be thought 
of as a tool for studying groups. 
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Figure 7.1 A sample GWRG produced by computer simulation by A. Janse van 
Rensburg. The host graph is a ternary tree, and n = 5. Both drawings depict 
the same graph: the right-hand figure respects the natural cyclic ordering of the 
leaves of the host tree, while the left-hand one makes the component structure 
clearer. 


We start by introducing the simplest special case of GWRGs. Let G be 
an infinite homogeneous tree, rooted at a vertex o, called the host graph; 
we will later allow G to be an arbitrary locally finite Cayley graph, or 
even a more general graph. Let Gn := G[{v € V(G) | d(v,o) < n}] 
be the ball of radius n centered at o, and define the boundary 0G, to 
be the set {vu € V(G) | d(v, 0) = n} of vertices at distance exactly n 
from o. 

We construct a random graph R,, as follows. The vertex set of Ry 
is the deterministic set 0G,,. The edge set of Ry is constructed by the 
following process. We start an independent simple random walk in G), 
from each vertex uv € 0G,,, and stop it upon its first return to dGp, letting 
vt denote the vertex in 0G, where this random walk was stopped. We 
then put an edge in Ry joining v to v! for each v € dG. 

We could stop the construction here and declare Rp, to be our GWRG, 
but it is more interesting to consider the following evolution: let Re = 
Ry, and for i = 2,3,... let RE be the union of ae with an independent 
sample of Rh or in other words, Re is the random graph obtained as 
above when we start i independent particles at each vertex in dGy. 

An important observation from [15] is that these random graphs R?, 
have the following scale-invariance property. Let C, D be two branches 
of our tree G, where a branch is a component of G\e for some edge e. 
Then, for any fixed i, we have the following observation. 


Observation 1.1 ({15]). The expected number of edges of R*, from 
branch C' to branch D converges as n > oo. The limit is always > 0, 
and it is finite of and only if CA D=9. 
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This might at first sight look surprising, as the number of vertices of 
RE inside each of C’, D grows exponentially with n, yet no rescaling is 
involved in Observation 1.1. 

The same construction can be repeated when instead of the binary 
tree the host graph is any infinite graph. Then Observation 1.1 has a 
generalisation, but in order to formulate it we need the Poisson boundary 
P: instead of the ‘branches’ of Observation 1.1 we have to talk about 
subgraphs ‘converging’ to a measurable subset of P. This is explained in 
greater detail in Section 2, where we also elaborate more on the general 
construction of GWRGs and their variants. 

The construction of GWRGs was motivated by a measure space 
introduced in [15], called the effective conductance measure, which is 
closely related to the Poisson boundary. It is a generalisation of effective 
conductance for electrical networks, and it is important for the study of 
Dirichlet harmonic functions on infinite graphs. More details about this 
measure are given in Section 3. 

I expect that GWRGs can unify many existing models of geometric 
random graphs, while introducing new ones, and offer new tools for 
analysing them including the Poisson boundary, as well as the notion 
of graphons [5]. Conversely, GWRGs provide an additional tool for indi- 
rectly studying groups, just like random walks; see Section 8.3 for more. 

The study of random graphs is currently one of the most active 
branches of graph theory. By far the most studied random graph model 
is that of Erdés & Rényi (ER) [10], in which every pair of vertices is 
joined with an edge with the same probability, and independently of 
each other pair. 

In recent years, many models of geometric random graphs have been 
emerging [16, 22]. The idea now is to embed the set of vertices (possibly 
randomly) into a geometric space—usually the euclidean or hyperbolic 
plane, and their higher-dimensional analogs—and then to independently 
join each pair of vertices with a probability that decays as the distance 
between the vertices in the underlying space grows. 

One advantage of these geometric random graphs compared to the 
ER model is that they can approximate real-life networks much more 
realistically, but they are also of great theoretical interest given the 
impact of the ER model. A disadvantage is that there is an infinity 
of such models, obtained by varying the underlying geometry, the way 
the points are embedded, and the connection probability as a function 
of distance, and no canonical choice is available. 

I like thinking of GWRGs as geometric random graphs, where the 
underlying geometry is a Cayley graph G of an arbitrary finitely 
generated group.! Although there is a huge variety for the underlying 


1 The Cayley graph of a group I with respect to some symmentric generating set S 
is the graph with vertex set T in which zy is an edge whenever y = zs holds for 
some sé 8S. 
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‘geometry’, there are tools for their analysis that apply to all cases and 
will be discussed here. 

This chapter is written in survey style although the material reviewed 
is quite new and partly under development, the main aim being to make 
the open problems of the project accessible to other researchers willing 
to get involved. A lot of the material is drawn from the paper [15] 
which is still in progress. New here is the definition of GWRGs and 
some observations about them. 


2. The General Construction of GWRGs 


In the Introduction we chose the host graph G to be a tree, the reason 
being that Observation 1.1 is easier to state in that case. Let us now 
consider the general case where the host graph G is arbitrary, and see 
how Observation 1.1 generalises, which will lead us to the definition of 
the effective conductance measure. 

The construction of Ri can be repeated verbatim, except that the 
number of particles we start at a vertex v € 0G» in round 27 is equal to 
the vertex degree dg, (v) of v in the ball G,; the reason will become 
apparent later. However, we could instead of starting exactly dg, (v) 
particles at v in round 7, start a random number of particles following 
some distribution with expectation dg, (v), the most natural candidate 
being the Poisson distribution; the following discussion remains valid for 
this variant of GWRGs. 

Observation 1.1, which is easy to prove in the case of trees, now 
becomes a substantial theorem, but in order to formulate it we need 
to involve the concept of the Poisson boundary of G to extend the above 
notion of branch in the correct way. 

The Poisson boundary of an infinite (transient) graph G is a mea- 
surable, Lebesgue—Rohlin, space P = P(G), endowed with a family 
of probability measures {u, | v € V(G)}, such that every bounded 
harmonic function h : V(G) > R can be represented by integration 
on P: we have h(v) = Ip hdpy() for a suitable boundary function 
h:P > R. This can be thought of as a discrete version of Poisson’s 
integral representation formula: 


h(z) ih 1— lal? jo) do ie i(0)dv.(0) 

Z= =. — vz(0), 
0 |e2ri0 _ 7/2 0 a 

recovering every continuous harmonic function h : D — R in terms of its 

boundary values h : S! > R, except that we replaced D with a transient 

graph. 

Triggered by the work of Furstenberg [12] who introduced the concept, 
the study of the Poisson boundary of Cayley graphs has grown into a 
very active research field; see [11] for a survey including many references. 
Although there is a straightforward abstract construction of the Poisson 
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boundary ?(G) of any Cayley graph, given a concrete G it is desirable 
to identify P(G) with a geometric boundary. This pursuit can, however, 
be very hard, although some general criteria are available [17, 18]. 

As an example, we remark that the Poisson boundary of a regular 
tree can be identified with its set of ends, and the Poisson boundary 
of a regular tessellation of the hyperbolic plane can be identified with 
its circle at infinity. More generally, the Poisson boundary of any non- 
amenable, bounded-degree, Gromov-hyperbolic graph coincides with its 
hyperbolic boundary [1, 2]. The Poisson boundary of a 1-ended bounded- 
degree planar graph can be identified with a circle [13]. 

Now back to Observation 1.1, letting G be an arbitrary transient 
graph, we consider measurable subsets X, Y of P(G). One can asso- 
ciate with these sets sequences of vertex sets (Xn)nen,(Yn)nen, where 
Xn, Yn C IGy in the above notation, such that for random walk on 
G from any starting vertex, the events of converging to X and visiting 
infinitely many of the X, coincide up to a set of measure zero, and 
similarly for Y and the Y,,. The first statement of Observation 1.1 
generalises as shown in the following theorem. 


Theorem 2.1 ({15]). The expected number of edges of R', from Xp, to 
Yn converges as n — OO. 


For the second statement of Observation 1.1 we remark that the limit 
is infinite when X 1 Y has positive measure, and there are, rather rare, 
interesting cases where the limit is infinite independently of the choice 
of X, Y as long as they have positive measure: any lamplighter graph 
over a transient graph has this property [15]. 

In order to understand why Theorem 2.1 (or Observation 1.1) is true, 
it is helpful to consider the well-known relationship between random 
walks and electrical networks as introduce by Doyle and Snell [9]. Think 
of Gp, as an electrical network with boundary nodes 0G;,,, at which we 
impose a constant potential v(b) = 1 for every b € 0G,,. This Dirichlet 
problem has the unique trivial solution v(#) = 1 for every « € V(Gp). 
Now, the aforementioned relationship tells us that the solution to any 
such Dirichlet problem can be obtained as follows: we start dg, (b)u(b) 
random walk particles at each boundary node b, and stop them upon 
their first re-visit to 0G,. Then letting u(x) be the expected number 
of visits to x by all those particles divided by the degree d(x) solves 
the Dirichlet problem [14]. But as our Dirichlet problem has constant 
boundary values, we then expect d(x) visits to each interior vertex x. 
This implies that if we start dg,(b) random walkers at each vertex 
b € dGp, stop them upon their first re-visit to 0G;,, and observe the parts 
of their trajectories inside Gy, for m < n, then the situation we observe 
will be similar as if we had performed the same process on Gm instead 
of G,. This is the central observation for the proof of Theorem 2.1. 

Theorem 2.1 will be crucial in the next section. 
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3. The Effective Conductance Measure C 


Theorem 2.1 relates our GWRGs to the Poisson boundary, but in fact 
the connection is more intricate. Before elaborating on this, we recall 
Douglas’ [8] formula 


Qn Qn “: 
E(h) = i} Pla) — HOPOC.E)dndE, 


expressing the (Dirichlet) energy of a harmonic function h on the unit 
disc D in the complex plain from its boundary values h on the circle 
S! = aD, where O(n, ¢) := 16x sin? ns . The physical intuition here 
is that E(h) is the power dissipated by: a circular metal plate, on the 
boundary circle of which some potential A:S! > Ris imposed by an 
external source or field. 

A discrete variant of this formula is the following, expressing the 
energy dissipated by a finite electrical network with a set B of ‘terminal 
nodes’, i.e. vertices that are connected to external power sources, in terms 
of the voltages at B and certain ‘effective conductances’ relative to B 
(see [15] for details). 


E(h) = Da pep (h(a) — h(b))? Cas. (7.1) 


In [15] we prove the following statement, providing a general formula 
for the Dirichlet energy of harmonic functions on a graph, which is similar 
to Douglas’ formula. 


Theorem 3.1 ({15]). For every transient graph G, there is a measure 
C on P(G)? such that for every harmonic function h: V(G) > R with 
boundary function h, the energy E(h) equals 


A ‘A A 2 
p¢iy =f  (fum - vey) een. 
P(G)? 


This measure C, which we call the effective conductance measure, can 
be thought of as a continuous analogue of effective conductance in a 
finite electrical network due to the similarity of the above formula with 
(7.1); in the next section we will elaborate more on this. 

The construction of C is based on Theorem 2.1: we set the value 
C(X, Y) to be the limit value returned by that theorem, and apply 
Caratheodory’s extension theorem to show that this defines a measure 
on P(G)?. This explains the relationship between GWRGs and C, and 
also my motivation in introducing the former. 


4. Doob’s Formula and Naim’s Kernel 


Theorem 3.1 was motivated by a similar result of Doob [7], which 
generalises Douglas’ formula to arbitrary Green spaces. I will refrain 
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from repeating the definition of Green spaces here, which can be found 
in [7], and suffice it to say that they generalise Riemannian manifolds. In 
a sense, our Theorem 3.1 is a discrete version of Doob’s result. But rather 
than working with the Poisson boundary —which first appeared the year 
after Doob’s paper [12]|—Doob worked with the Martin boundary, which 
is closely related to the Poisson boundary (the latter can be defined as the 
support of the former, but the former is also endowed with a topology). 
Moreover, Doob’s approach to the effective conductance measure was 
different from ours: rather than working with the measure, Doob was 
working with its density, namely the Naim kernel. Let me explain this 
further, as it will be of interest later. 
Naim [20] introduced the formula 


G(a, y) 
G(a, 0)G(o, y)’ 


where x,y are points of a Green space X, and G(.,-) denotes the 
corresponding Green’s function (see [6] for definitions). The same 
formula, however, makes perfect sense when z, y are points of a transient 
graph G, in which case G(z, y) denotes the discrete Green’s function, i.e. 
the expected number of visits to y by random walk from z. 

Naim proved that ©@(z, y) can be extended to pairs of points {y,¢} in 
the Martin boundary of X, by taking a limit 


O(n, o) = lime +n,y>eO(@, y). (7.2) 


The convergence of this limit is only clear when one of x,y is fixed 
and the other converges to a point 7 in the boundary, as it reduces 
to the well-known convergence of the Martin kernel K(x, y) := ae 
[26]. The two-sided convergence is a puzzling fact, and in fact is not 
true everywhere but ‘almost everywhere’: the exact statement proved by 
Naim [20] is too technical to state here precisely. It involves Cartan’s 
fine topology. As far as I know, this convergence has not been proved for 
graphs; we will return to this issue below. 

Our intuitive interpretation of ©(7,¢) is that it denotes the ‘effective 
conductance density’ between the boundary points n,¢.'To support this 
intuition, we remark in [15] that if 7, y are boundary vertices of a finite 
electrical network, then ©(z, y) does indeed coincide with the effective 
conductance between x and y if the Green function G is defined with 
respect to random walk killed at the boundary. 

Doob’s formula for the Dirichlet energy of a harmonic function h on 
X, with boundary extension A on the Martin boundary M(X) of X (see 
e.g. [26] for the definition) reads 


O(2,y) = 


im A A 2 
pity = ff (fy =) Ola, 2yduolM Aol. 
M(x)? 


Notice the similarity to the formula of Theorem 3.1 and Douglas’ (7.1). 
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As we show in [15], our effective conductance measure C is absolutely 
continuous with respect to the square of harmonic measure [49 on P, and 
so we can define a kernel ©’ on P? by the Radon-Nykodym derivative 
Au ¢). The above discussion suggests that ©’ should coincide with © 


if © can be defined by a limit similar to (7.2). Now instead of trying 
to imitate Naim’s proof of the convergence of (7.2), which is rather 
technical, we propose the following problem. 


Problem 4.1. Let G be a transient graph and o € V(G). Let 
(Zn)nen and (yn)nen be independent simple random walks from o. Then 
limn,m—+oco O(%n, Ym) exists almost surely. 


This is seemingly easier than trying to prove the convergence of (7.2), 
since, for example, it is easier to prove the Martingale convergence 
theorems than Fatou’s theorem. If it is true, then one could further ask 
the following. 


Problem 4.2. Let G be a transient graph and o € V(G). Let (tn) nen 
and (Yn)nen be independent simple random walks from o, and X, Y mea- 
surable subsets of P(G). Then C(X, Y) = Elimp moo May (X My, (Y) 
O(zn,Y¥m), where C denotes the effective conductance measure. 


The factors fz, (X), My, (Y) in the above limit essentially ‘condition’ 
the random walks to converge to X, Y respectively. This last problem is 
motivated by our belief that 0’ = ©. 


5. Random Interlacements 


Recall that we have defined C(X, Y) as the limit of the expected number 
of edges of our GWRG R! from X,, to Yn. In doing so, we were only 
interested in the starting and finishing vertices of our random walks, 
ignoring the exact trajectories inside the host graph G. In this section we 
remark that the distributions of these trajectories converge, in a certain 
sense, to the intensity measure of the random interlacement model as 
introduced by Sznitman [23] for G = Z% and generalised to arbitrary 
transient G by Teixera [24]. 

Given a transient graph G, the random interlacement on G is a Poisson 
point process on the space of two-way infinite trajectories in G modulo 
the time shift. It is governed by a o-finite measure v on (W*, W*), where 
W* denotes the set of equivalence classes of two-way infinite walks in 
G with respect to the time shift, and W* is the canonical sigma-algebra 
on W*. We remark that this measure v can also be obtained from the 
process we used to construct our GWRG R} as follows (the original 
definition of Sznitman will be given below). 

Let C be a cylinder set of W* defined by a finite walk Z, ie. C is 
the set of two-way infinite walks containing Z as a subsequence. Let Pr, 
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be the random process from the definition of Re that is, a collection 
of particles performing random walk starting at dG, and stopped upon 
the first revisit to 0G,, with dg, (x) particles started at each v € dGy. 
We set 


w(C) := lim, E/#{trajectories in P, containing Z as a subwalk}], 
(7.3) 


and extend « to a measure on (W*,W*) using e.g. Caratheodory’s 
extension theorem. It turns out [15] that 


wh=v. 


An interesting consequence of this is the following relation between the 
effective conductance measure C and the random interlacement intensity 
measure, v: 


Corollary 5.1. For every two measurable subsets X,Y of P(G), we 
have 


C(X, Y) = v(Wyy), 


where Wy denotes the set of elements of W* all initial subwalks of 
which meet infinitely many Xy (as defined in Section 2) and all final 
subwalks of which meet infinitely many Yn. 


In order to explain why this is true we need to recall the definition of 
the intensity measure v from [24]. 

To begin with, given a finite subset K of V(G), we define the 
equilibrium measure ex on K by 


eK (£4) = IgeK CePz[A | random walk never returns tok], 


where cy is the vertex degree of x. 

Let 2* be the canonical projection from the set of two-way infinite 
walks W to W*. Then v is defined as the unique measure on (W*, W*) 
satisfying, for every finite subset K of V(G), 


lwy -v=7*0 Qk. (7.4) 


(Another way to state this formula is v(1 We -A) = Qx(a*—!(A)), where 


x*—1(A) returns those walks in the z*-preimage of A that enter K at 
time 0 for the first time.) 

Here, Qx is a finite measure on the space Wx of two-way infinite 
walks meeting K, given by the formula 


QK|(X-n)nz0 € A, X0 = 2, (Xn)nz0 € BI 
= P,[A | no return to K]ex (x)P,[B], 


where A, B are measurable subsets of the space W of one-way infinite 
walks. Let me explain why v coincides with yz as claimed above. The main 
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idea is to think of the equilibrium measure ex (xz), which is proportional 
to the probability of escaping K, as the probability for a random walker 
coming from ‘infinity’ to enter K at x; this intuition is justified by the 
reversibility of our walks, and coming from infinity can be made precise 
using the process used to construct GWRGs (and j1). 

To make this more precise, let G,, denote the ball of radius n around 
o, and suppose that K C V(G,) for some K C V(G). Let GF be the 
graph obtained from G by contracting the complement of Gp, into a single 
vertex *7,. Then the reversibility of our random walk implies that, for 
every x € K, letting Py denote the law of a random walk Xo, X1,...X7 
from y stopped the first time t > 0 when X; € K U {xn}, we have 


CeP [Xr = *n| = Cin Px, [Xr — a]. 


Now note that, by the transience of G, the limit as n goes to infinity of 
the left-hand side converges to ex (x), while the limit of the right-hand 
side is closely related to our definition of jw. 

Random interlacements have been studied extensively, and researchers 
have found interesting applications in the study of the vacant set for 
random walk on discrete tori [19]. This connection to the Poisson 
boundary has apparently not been observed before. 


6. Graphons 


Graphons were recently introduced [5] as a notion of limit for sequences 
of dense finite graphs, and they have already had a seminal impact on 
combinatorics. Formally, a graphon is a symmetric, measurable function 
w : [0,1]? > [0,1]. 

Every graphon naturally gives rise to a family of random graphs G), 
on n vertices for every n € N: we sample n independent, uniformly 
distributed points from [0,1] to be the vertices of G,, and join vertices 
x,y with probability w(a, y). 

Graphs sampled this way are dense, i.e. have average degree of order n. 
But a variant of the above sampling method introduced in [4] produces 
graphs of average degree o(n). 

Now note that formally, our effective conductance measure C is similar 
to a graphon, as it is a measure on a square. In particular, we can sample 
a random graph from it. We expect these graphs to be closely related to 
our GWRGs. 


Problem 6.1. Show that for every (transient) Cayley graph G, the 
sequence of corresponding GWRGs converges to a sparse graphon in the 


sense of [4]. 


AGELOS GEORGAKOPOULOS 200 


7. Simulation Data 


Computer simulations on GWRG for certain concrete host graphs G, 
performed by Chris Midgley [19], suggest that certain properties are 
heavily influenced by G, while convergence with n is fast enough that 
simulation data can help make explicit predictions. 

In most simulations the host graph was the 2- or 3-dimensional grid Z? 
or Z}, the infinite binary tree T2, a hyperbolic planar graph T° obtained 
from the binary tree by adding a cycle C,, joining the vertices at distance 
n from the root for every n, and the lamplighter graph L over Z. 

The outcomes of these simulations, with 10,000 random graphs Ri 
generated in each case for n up to 8 (where the exponential growth of 
T° and L become computationally demanding), suggest the following: 


e The number of isolated vertices of Bi is asymptotically proportional 
to the number of vertices |dG,,| of Rt. The same is true for the 
number of components of Ri. The corresponding leading coefficients 
are very similar (possibly converging to the same number) when G 
is Z? or T°, slightly different when G = L, and very different when 
G=2Z3. 

e The expected diameter of the largest connected component of te 
seems to be proportional to log(|dG |). 


In a further experiment of [19] on the infinite binary tree Tz, random 
graphs Ri were generated for 2 < n < 8 for all values of 7 until the 
first time that Rt becomes connected. The data suggest (quite clearly) 
that the average time 7 till connectedness is roughly 0.26n, i.e. linear 
in n. (The exact outcome was i = (0.26 + 0.003)& + (0.90 + 0.02) with 
an adjusted R? value of 0.9993.) 


8. Further Problems 


8.1. Dirichlet Harmonic Functions 


Let G be a graph on which all harmonic functions with finite Dirichlet 
energy are constant; the class of such graphs is denoted by Oyp. By 
Theorem 3.1, we can introduce the following trichotomy for such graphs: 


(i) P(G) is trivial (in other words, G has the Liouville property), or 
(ii) P(G) is not trivial, and C(X, Y) = oo for every two measurable 
X,Y C P(G), or 
(iii) none of the above holds, but the integral of Theorem 3.1 is infinite 
for every boundary function h. 


Recall that the property of being in Oyp is quasi-isometry invariant 
for graphs of bounded degree [25], and therefore, given a group I, it 
is independent of the choice of the Cayley graph of IT. A well-known 
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open problem asks whether the Liouville property too is independent of 
the choice of the Cayley graph of I. The above trichotomy suggests the 
following refinement of that problem for groups in Oyp. 


Problem 8.1. Let G be a Cayley graph in Oyp. Do all finitely 
generated (in other words, locally finite) Cayley graphs of the group of 
G have the same type with respect to the above trichotomy? 


We remark that Cayley graphs of all three types exist: Z? is of type 
(i), lamplighter graphs over any transient Cayley graph are of type (ii) 
[15], while all tessellations of hyperbolic 3-space H? are of type (iii). 


8.2. GWRGs 


None of the results suggested by the simulations of Section 7 have been 
proved rigorously, and it would be interesting to do so, especially if the 
methods involved can be applied to large families of host graphs. Results 
on the expected number of components or isolated vertices of AR could 
be within reach. 

The interesting meta-problem is to find natural properties of the 
GWRGs that are universal in the sense that they are true independently 
of the host graph G (under some restriction, e.g. G being vertex- 
transitive), as well as properties that do depend on G and the dependence 
can be explained. For example, what can be said about R,, if the host 
G is hyperbolic? 

Erdos Rényi random graphs G(n,p) are known to display a sharp 
threshold for their connectedness at the value p = In n/n for the presence 
of each edge. This threshold coincides with the threshold for having 
an isolated vertex [10]. This motivates the question of whether similar 
behaviour is observed for GWRGs. To make this more precise, define 
the random variable t, := minf{i | Gi is connected}. Our first question 
is how concentrated t,, is (we interpret high concentration as a sharp 
phase transition). The second question is whether the threshold for 
connectedness coincides with that for absence of isolated vertices: letting 


Tt, := min{i | G} has no isolated vertices}, 


Problem 8.2. Js lim, £2 =1# 


Et* 


This might depend on the host graph G, and the answer might be 
positive for every ‘nice’ host, e.g. any Cayley graph. 

For any host graph G, and n é€ N, consider the ball Gn as a 
random rooted graph G* by rooting it at a uniformly chosen vertex 
in the boundary dG,. Then, as suggested by Gourab Ray (private 
communication), if G* converges in the local weak sense, then our 


2 This question was answered in the negative when the host graph is the binary tree 
by J. Haslegrave (private communication) after this chapter was accepted. 
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GWRG R? should converge for every i in the Benjamini-Schramm sense 
[3]. The relationship between this limit and the host graph could be 
interesting. (But even if G* does not converge, it will have sub-sequential 
limits and the situation is not less interesting.) 

A particular case of interest is where G is the grid Z*, and we start 
particles at dG, according to the Poisson distribution in the construction 
of Ri. The aforementioned limit coincides then with the long-range 
percolation model as defined e.g. in [21] (this connection was again 
noticed by Gourab Ray). 

It would be interesting to compare our GWRGs induced by certain 
simple Cayley graphs, like tessellations of the hyperbolic plane, to other 
geometric random graphs from the literature: 


Problem 8.3. Show that other geometric random graph constructions 
(e.g. those appearing in [22]) can be obtained as special cases of (or 
approximated by) GWRGs by choosing an appropriate underlying graph. 


8.3. Groups 


Our main objective is to understand the interplay between typical 
properties of GWRGs and their host groups. In particular, we have: 


Problem 8.4. Which properties of the random graphs are determined 
by the group of the host graph and do not depend on the choice of a 
generating set? 


A further problem in this vein is the following. 


Problem 8.5. Are the properties of transience, Liouvilleness, exis- 
tence of harmonic Dirichlet functions, and amenability on the host 
Cayley graph detectable by the asymptotic behaviour of the corresponding 
GWRGs? 


Answers to these problems would make GWRG a tool for studying 
group-theoretical questions, on a par with the study of random walks on 
groups. 


9. Conclusions 


We introduced a construction of ‘geometric’ random graphs (GWRGs), 
and established strong links to the Poisson boundary of their host graphs, 
the effective conductance measure and Naim’s kernel, and to random 
interlacements. This project will be successful if we can exploit these 
connections in order to make conclusions about one of these objects by 
studying the other. Another major aim is to relate GWRGs to existing 
models of geometric random graphs. Last but not least, we would like 
to understand the effect of various properties of the host group on the 
typical graph-theoretic properties of GWRGs. 
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Abstract 
We prove that the trace of a transient branching random walk on a 
planar hyperbolic Cayley graph has a.s. continuum many ends and no 
isolated end. 
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1. Introduction 


A branching random walk (BRW) is a growing cloud of particles on 
some graph G. In this note we consider BRWs in discrete time, which 
are defined as follows. The process starts with one particle in the root 
o of the graph G. At each time step each particle splits into offspring 
particles, which then move one step according to a random walk on G. 
Particles branch and move independently of the other particles and 
the history of the process. A BRW is therefore driven by two classical 
stochastic processes: Galton—Watson processes and random walks. Under 
the assumption that the underlying Galton—Watson process survives the 
number of particles grows exponentially. If the return probabilities of 
the underlying random walk decay subexponentially the effect of the 
growing particles overshadows the transience of the spatial dynamic, and 
the underlying graph will eventually be full of particles. However, if the 
return probabilities of the random walk decay exponentially as well, then 
there is a critical growth rate of the Galton-Watson process where the 
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two exponential effects cancel out. Above the critical value the BRW 
is again recurrent, i.e. every finite set is visited infinitely many times 
with positive probability; below and at the critical value every finite set 
is eventually free of particles and the BRW is called transient. In the 
transient case, the set of visited vertices and traversed edges defines a 
proper random subgraph of G and its properties become of interest. This 
subgraph is called the trace Tr of the BRW. 

Let « = (%)e>0 be the probability distribution that describes the 
branching, i.e. each particle produces k offspring with probability wz. 
The expected number of offspring is denoted by m = )°).9 kug. In this 
note we assume the underlying graph G = G(I',$) to be the Cayley 
graph of a finitely generated group I’ with respect to a finite symmetric 
generating set S. The movement of the particles is governed by a driving 
measure gq on S'U{e}, where e is the group identity. The driving measure 
q defines a transition kernel P = (P@.))e yer by p(a,y) = q(x~ty) 
for all «,y € T. The corresponding n-step transition probabilities are 
denoted by p(x, y). 

We make the following standing assumptions. 


Assumption 1.1. 


e The underlying Galton—Watson process is supercritical, that is, 
m> 1. Furthermore, we assume that uo = 0 and 1 > 0. 

e The driving measure q of the random walk on G is symmetric, 
i.e. g(s) = q(s—!) for all s € S, and satisfies supp(q) = SU {e}. 


These assumptions are to some extent chosen to improve the presen- 
tation. The assumptions that are really necessary are that m > 1 and 
that the driving measure q is symmetric. 

The spectral radius, 9 = p(P) = limsupy_..5(p™(e, e))/”, is a 
crucial quantity in the study of BRWs: a BRW on a Cayley graph is 
transient if and only if mp < 1. This is a consequence of the classification 
of recurrent groups and Kesten’s amenability criterion, see also [8] for 
an alternative proof. We speak of a critical BRW if mp = 1. 

It was shown in [3] that the trace of a transient BRW on a Cayley 
graph G is a.s. transient for simple random walk but recurrent for BRW. 
Therefore, the trace of a transient BRW is a.s. a proper subgraph of G. It 
is believed that the trace shares many properties with infinite percolation 
clusters in the non-unicity phase. In the case of BRWs on free groups 
(or regular trees) it even turns out that the law of the trace of a BRW 
is the law of an infinite cluster of some invariant percolation, see [2, 3]. 
However, the situation is not as clear for other Cayley graphs, especially 
one-ended Cayley graphs. 

This note is devoted to the following property of invariant percolation 
and its analogue for the trace of BRWs. A bond percolation is a probabil- 
ity measure on subsets (also called configurations) w of the edge set E of 
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the graph G. If the distribution of @ is invariant under the left action of 
I, we call the percolation (I-)invariant. A percolation is called insertion- 
tolerant if AU {e} has positive probability for alle € EF and all edge sets A 
having positive probability. For every invariant insertion-tolerant perco- 
lation process on a non-amenable Cayley graph G that has a.s. infinitely 
many clusters, we have that a.s. every infinite cluster has continuum 
many ends, no isolated end, and is transient for simple random walk, see 
Theorem 8.32 in [12] where this is stated in more generality. 

On groups with infinitely many ends, there are various ways to see 
that the trace has infinitely many ends. In some cases even the Hausdorff 
dimension of the set of ends can be calculated; see [11] for free groups 
and [6] for free products of groups. 

As pointed out in [7] there is an elegant argument using symmetry that 
the trace of a subcritical BRW, i.e. m < 1/p, has infinitely many ends. 
This argument extends to the critical case if 7,5, nm™p™(e, e) < ov. 
However, this condition does not apply to the most interesting graphs 
like Cayley graphs of hyperbolic groups and the following conjecture 
remains open. 


Conjecture 1.2 (I. Benjamini). Let G be any non-amenable vertex 
transitive graph and assume the transition kernel P of the underlying 
random walk to be symmetric. Then the trace of a transient BRW has 
infinitely many ends. 


Remark 1.3. The assumption of symmetry is crucial, as there are non- 
symmetric driving measures that induce one-ended traces, see [7]. 


We answer this conjecture affirmative for BRWs on planar hyperbolic 
Cayley graphs. We briefly recall the definition of a Gromov hyperbolic 
group. A geodesic triangle consists of three elements u, v, w € I’ together 
with three geodesic paths connecting each pair of these vertices. Let 
5 > 0. A geodesic triangle is called 6-thin if each side is contained in a 
6-neigbourhood of the two other sides. A graph is called hyperbolic if it 
satisfies the thin triangle condition for some 6. A hyperbolic group is a 
finitely generated group whose Cayley graphs are hyperbolic. 


Theorem 1.4. Under Assumption 1.1 the trace of a transient BRW 
on a planar hyperbolic Cayley graph has a.s. continuum many ends and 
no isolated end. 


The proof uses the concept of unimodular random graphs. This class of 
graphs can be considered as stochastic generalizations of Cayley graphs 
and are also known as stochastic homogeneous graphs. We will use the 
fact that the trace of a BRW is a unimodular random graph and therefore 
gives rise to the application of the generalized Mass-Transport principle. 
This property is used to prove that a.s. every trace that has at least 
3 ends has no isolated end, see Proposition 3.1. The main issue will 
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be to prove that the trace has a.s. no isolated end, see Proposition 
3.2, and hence infinitely many ends. The proof of this proposition uses 
crucially the planarity of the Cayley graph, a recent result in [9] that 
Ancona’s Inequality still holds true for the Green function at the radius 
of convergence, and Proposition 3.1. 

In Section 2 we give some background on unimodular random graphs 
(URG) and random walks on hyperbolic groups. We believe that the 
information given there is sufficient to follow the proof of Theorem 1.4 
in Section 3. However, for readers who are not familiar with the concept 
of URGs and with random walks on hyperbolic groups, it might be useful 
to consult some of the references given along Section 2. 


2. Preparations 


2.1. Definition and Preliminaries 


We use the standard notation for a locally finite graph G = (V, FE): 
V is the set of vertices, F is the set of edges, and we write z ~ y if 
(z, y) € E. The distance between two vertices is the length of a shortest 
path between these vertices and will be denoted by d(-.,-). We write 
(G, 0) for a rooted graph with root o. 

Let I. be a finitely generated group with group identity e; group 
operations are written multiplicatively. The group I together with some 
finite symmetric generating set S induces a Cayley graph G = G(T, S) 
whose vertex set equals [ and « ~ y if and only if 2~!y € S. The group 
identity e of I will be identified with the root o of the Cayley graph G. 

Let q be a probability measure on the generating set S U {e}. The 
corresponding random walk (Sn)ns90 on G is a Markov chain with 
transition probabilities p(x, y) = q(a~'y) for x,y € T. Equivalently, the 
random walk (starting in x) can be described as Sp, = #X1---Xn, n> 1, 
where the X;’s are i.i.d. random variables with distribution q. 

Besides the definition of BRWs given in Section 1 there is another 
powerful description of BRWs. This definition is based on the concept 
of tree-indexed random walks introduced in [4]. Let (T,r) be a rooted 
infinite tree. The tree-indexed random walk can be described as a 
marking (or labelling) of the rooted tree (T,r). For any vertex v Ar 
denote by v~ the neighbour of v closest to r. Label the edges of T with 
iid. random variables Xy’s with distribution gq; the random variable X, 
is the label of the edge (v~,v). These labels correspond to the steps 
of the tree-indexed walk and the positions of ‘particles’ are given by 
Sy = @- JT] Xv, where (uo =r, v1,..., Un = v) is the unique geodesic 
from r to v at level n. 

A tree-indexed random walk becomes a BRW if the underlying tree T 
is a realization of a Galton—Watson process. We call T the family tree of 
the BRW. 
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2.2. Unimodular Random Graphs 


In this note we only give the essentials needed for our proofs. We invite 
the reader to consult [3, 13] for more details on the connection between 
BRWs and unimodular random graphs, and [1] for a more general 
introduction to the concept of URGs. 

A rooted isomorphism between two rooted graphs (G, 0) and (G", 0’) 
is an isomorphism of G onto G’ which maps o to o’. We denote by 
G,, the space of isomorphism classes of rooted graphs and write [G, o] 
for the equivalence class that contains (G,o). In the same way one 
defines the space G,.. of isomorphism classes of graphs with an ordered 
pair of distinguished vertices. That is, (G1, 01,02) and (G2, 0}, 05) are 
isomorphic if and only if there is an isomorphism from G1 onto G2, which 
maps 01 to 0; and 02 to 05. The spaces G, and G,,, can be equipped with 
metrics that turn them into separable and complete metric spaces. 

A Borel probability measure v on G, is called unimodular if it obeys 
the mass-transport principle (MTP): for every Borel function of the form 
f : Gux — [0,00], we have 


[XO AGo.2 a6, 0) = f Y> f(G, 2, 0)dv([G, o]). (8.1) 


reV zeV 


Realizations of unimodular measures are called unimodular random 
graphs. 

An important class of unimodular measures arises from Galton— 
Watson processes. The Galton—Watson tree is defined inductively: start 
with one vertex, the root r of the tree. Then the number of offspring of 
each particle (vertex) is distributed according to w. Edges are between 
vertices and their offspring. We denote by GW the corresponding 
measure on the space of rooted trees. In this construction the root clearly 
plays a special role and GW is not unimodular. However, if we bias the 
distribution such that the probability that the root has degree k + 1 is 
proportional to ane we obtain a unimodular measure UGW. When we 
use the UGW measure instead of the standard GW measure to define 
the family tree of the BRW, we denote the BRW by UBRW. 

Due to the description of the BRW as a tree-indexed random walk the 
unimodularity of UGW carries over to the trace: the trace of a UBRW 
on a Cayley graph is a unimodular random graph; see Theorem 3.7 
in [3]. This property makes the UBRW more natural to consider than 
the original BRW and we will prove Theorem 1.4 for UBRWs. However, 
it is not difficult to see that it then also holds true for BRWs. 


2.3. Ends of Graphs 


Consider a locally finite graph G = (V,E). A ray is a sequence 7 = 
(x9, %1,..-) of distinct vertices such that 2; ~ 2j+1 for all i > 0. For 
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any finite set F of vertices we consider its complement G\F, which is 
the graph induced by the vertex set V\F. This graph consists of finitely 
many connected components. Every ray a must have all but finitely 
many points in exactly one component; we say that a ends up in that 
component. Two ends are equivalent if they end up in the same connected 
component for all choices of F. Ends are equivalence classes of rays and 
we denote by ?G the set of ends. Let F be a finite vertex set and C’ 
be some component of G\F. We write 0C' for the set of ends whose 
rays end up in C’. The space of ends 3G can be equipped with a discrete 
topology in the following way. For any finite set F and any end w there is 
precisely one component of G\F whose completion contains w. Varying 
F yields a neighbourhood base for w. An isolated end is an end that is 
isolated in this topology. 


2.4. Hyperbolic Groups and Random Walks 


From now on we assume that the underlying group I is hyperbolic, and 
the generating set S induces a planar Cayley graph G = G(T, S$). Both 
assumptions, hyperbolicity and planarity, are crucial. 

Let us first collect several classic facts about hyperbolic groups and 
random walks; we refer to the survey [5] for an excellent introduction. 
An elementary hyperbolic group is either finite or has two ends. We 
will focus on the case of non-elementary hyperbolic groups since random 
walks on them are transient. Define the Green functions 


(oe) 
Gr(e,y) = >) pM, yr", 2, y eT, 
n=0 


for all r < R := 1/p. It is proved in [9] that for finite-range random 
walks on hyperbolic groups Ancona’s Inequalities hold up to the radius 
of convergence: there exists some C' > 0 such that for any z,z € TP and 
for any y on a geodesic segment from x to z we have 


G,(x,z) < CG,(z,y)Gr(y,2z) VWre [1, RI. 
Symmetry of the random walk implies, see Lemma 2.1 in [10], that 


lim Gp(e,r)=0. 


d(e,xz)—>0o 


Eventually we obtain, see Lemma 2.2 in [10], that the Green functions 
of the random walk decay exponentially, that is, there exist some 
constants C, and g <1 such that for all z,y eT 


Gr(a,y) = Cre?™. (8.2) 


In the case of planar Cayley graphs, the Cayley 2-complex is the 2- 
complex such that the one-skeleton is given by the Cayley graph G and 
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the 2-cells are bounded by loops in G. The 2-complex is homeomorphic 
to the hyperbolic disc and it can be endowed with an orientation. 
This orientation is used implicitly at several points in the proof of 
Proposition 3.2. Moreover, the Gromov hyperbolic boundary 0G can be 
identified with the unit circle. 


3. Proof of Theorem 1.4 


The following result holds true for traces of transient BRWs on Cayley 
graphs (or even URGs). Its proof is an adaptation of the one for invariant 
percolation, see Proposition 8.33 in [12]. As the arguments are short we 
can present the details. We stick as close as possible to the notations 
in [12]. 


Proposition 3.1. Consider the trace of a transient symmetric UBRW 
on a Cayley graph. Almost surely every trace that has at least three ends 
has no isolated end. 


Proof. For each n € N let Ap be the union of all vertex sets A C Tr 
such that the diameter diam(A) < n (in the metric in Tr) and Tr\A 
has at least three infinite components. If Tr has at least three ends then 
An # % for all but finitely many n. We assume from now on that Tr has 
at least three ends. 

Let be n EN. For any vertex x in Tr let 


C(x) = {ty € An: d(x, y) = min d(x, z)} 
z n 
be the set of vertices in Ay that are closest to x in the metric in Tr. We 
define the Borel function F' : Gx —> [0, oo]: 


1 . 
Ta ltyec@), if An #9, 


F(Tr,2,y) = 
‘ ¥) | 0, otherwise. 


The function F is well defined since F(Tr,o,x) is invariant under 
isomorphisms. Given that the law v of the rooted trace [Tr, o] is 
unimodular we can apply the MTP to obtain that the expected mass 
received by o is at most 1: 


i \> F(tr, x, 0)dv([Tr, o]) = )> F(tr, 0, «)dv([Tr, ol) <1. 


veTr wveTr 


Assume that & is an isolated end of Tr and let us show that this leads 
to a contradiction. We describe € by a geodesic ray y = (0, 21, 22,...) 
starting at o. There exists some finite set Fo = {y € Tr | d(o, y) < Ro} 
for some Ro € N such that there is a connected component Uo of G \ Fo 
whose completion contains only € on the boundary. Choose now n € N 
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large enough such that there exists some z € y with  € Up N An. If 
we choose R € N large enough and set F = {y € Tr | d(o,y) < R} 
then the connected component U of G\F whose completion contains 
only € on the boundary satisfies UM A, = 9. Now we set B = FO 
Up which is finite. This provides that all paths from U to all other 
connected components of G\F must pass through B. As for all y € 
Un y we have that d(y,z) < d(y, Fo) we see that a finite subset of 
vertices of B gets all the mass from all the vertices in UN y. As UN y 
contains infinitely many vertices the set B receives infinite mass. Using 
the property that ‘everything shows at the root’, see [1, Lemma 2.3], we 
obtain that f >> er, F (Tr, 2, 0)dv({Tr, o]) = 00, a contradiction. Hence, 
almost surely every trace with A, 4 @ does not have isolated ends. 
Because this holds for all but finitely many n € N, we obtain that almost 
surely a trace with isolated ends can have at most two ends. 


Proposition 3.2. Under Assumption 1.1 the trace of a transient 
UBRW on a planar hyperbolic Cayley graph has a.s. no isolated end. 


Proof. We start with some preparations. Since the Gromov boundary 
0G can be identified with a circle there exist infinite geodesics y1, yg and 
y3 starting from o with distinct boundary points &,,& and &3. Denote 
by y = Ui (vi, where {y;} is the set of vertices in y;. We define 
dG to be the part of the Gromov boundary that is between & and 
&; and Sj; to be the set of vertices between y; and y;. Let K be some 
large positive constant to be chosen later. Geodesics in hyperbolic groups 
either converge to the same boundary point or diverge exponentially. 
Hence, for K sufficiently large there exist r1 € S12, x2 € S93, and 23 € $31 
such that d(a;,y) = K for all 7 € {1, 2, 3}. 

Let 7 € {1, 2,3} and consider the sphere $(2;, n) of radius n around 27; 
in the Cayley graph metric. Since the y;’s are geodesics and triangles 
are thin we know that there exists some constant C2 such that 
|S(aj,n)Ny| < Con. Now, start a BRW in 2; and denote by Tr; the 
trace of the BRW started in 2;. Using Markov’s Inequality and Inequality 
(8.2) we obtain that 


PlitriN yl = 1) SEliTrsN yi] < D> Gm(ai,y) < D> CiCane”, (8.3) 
yey n>K 


which tends to 0 as K tends to infinity. We note that Gy,(a;, y) is the 
expected number of particles visiting y when starting a BRW with one 
particle at 2;. 

We assume now that P[Tr has an isolated end] = c > 0 and show 
that this yields a contradiction. Inequality (8.3) allows us to choose K 
sufficiently large such that for 7 € {1, 2, 3}, 


P[Tr; has an isolated end, |Tr; N y| = 0] > 0. 
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We start the UBRW in o and condition the UBRW on the event that at 
time N = max{d(o, 2) : 7 € {1, 2, 3}} in each of the vertices x1, x2 and 23 
there is exactly one particle and no particle elsewhere. This is possible 
since we assume that 1 > 0 and q(e) > 0. So what happens at times 
n > N has the same distribution as we start three independent BRWs 
in 21,22 and 23. Eventually, using the planarity of G we get that Tr has 
with positive probability at least three distinct ends including at least 
one isolated end, which yields a contradiction to Proposition 3.1. 


It remains to show that the trace has a.s. continuum many ends. 


Corollary 3.3. Under Assumption 1.1 the trace of a transient UBRW 
on a planar hyperbolic Cayley graph has a.s. continuum many ends. 


Proof. Due to Proposition 3.2 the trace Tr must have infinitely many 
ends because otherwise each end would be isolated. Moreover, each 
infinite connected component of Tr\B(o,n) must contain at least two 
ends; otherwise, such a component would contain an isolated end. Thus, 
the number of ends is at least of order |2|, which proves the claim since 
Tr is of bounded degree. 
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Abstract 

The purpose of this expository and historical article is to revisit the 
notions of amenability and ergodicity, and to point out that they 
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1. Introduction 


The notion of amenability for groups has several historical origins. 
Most authors cite the article [vNeu-29] of von Neumann, sometimes 
complemented by a note [Tars—29] of Tarski, developed in [Tars—38]. 
Von Neumann and Tarski were motivated by the paradox of Banach 
and Tarski [BaTa-24], and its origin in a decisive observation of 
Hausdorff [Haus—14, page 469]. There is a good and comprehensive 
exposition of this subject in [Wago-85]. In these references, groups 
do not have topology; in other words they are just ‘discrete groups’ 
(even if topological groups appear in Wagon’s book, they appear only 
marginally). Amenability is considered explicitely for topological groups 
in later articles [Dixm—50, Fomi-50, Rick-67]. But attention has often 
been restricted to two particular classes of topological groups: discrete 
groups (and semi-groups), as in [HeRo-63, § 18], and locally compact 
groups, as in the influential book of Greenleaf [Gree—69]. 

The first goal of this chapter is to indicate that the notion of 
amenability has at least one other independent origin. Indeed, we were 
surprised to discover that topological groups G, such that there exist 
left-invariant means on C>(G), are already the main subject of the 1939 
article of Bogolyubov [Bogo-39], where appears the class of topological 
groups that we call ‘B-amenable’ in Definition 3.8 below; the ‘B’ refers 
to Bogolyubov. Here, C>(G) stands for the Banach space of bounded 
continuous real-valued functions on G. 

The second goal of this chapter is to expose and survey various aspects 
of amenability as they occur for topological groups that need not be 
locally compact. Results stated (and often proved) below are not original. 


1.1. Plan of the Chapter 


We add some historical comments in Section 2. In Section 3, we review 
the basic notions: amenability and the related fixed point property; 
and also B-amenability; for locally compact groups, the three notions 
are equivalent. Section 4 is about their hereditary properties. Section 
5 contains three examples of amenable topological groups that are not 
locally compact: the unitary group of a separable infinite dimensional 
Hilbert space, the symmetric group Sym(N) of the positive integers, and 
the general linear group GL(V) of a vector space of countable infinite 
dimension over a finite field (with their Polish topologies). Section 6 
provides more examples which have indeed a stronger property: they 
are extremely amenable. In Section 7, we discuss several definitions of 
ergodicity for actions on compact spaces; they agree for second countable 
locally compact groups, but not in general, as shown in Section 8 by 
an example of Kolomogorov involving Sym(N). The final Section 9 
describes a characterization, due to Fomin, of ergodicity in terms of 
unitary representations. 
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It is convenient to agree that all topological spaces and groups 
appearing in this chapter are assumed to be Hausdorff. 


2. Some Historical Comments 


Let G bea topological group and X a non-empty compact space on which 
G acts continuously by homeomorphisms. Denote by P(X) the space 
of probability measures on X, by P&(X) its subspace of G-invariant 
probability measures, and by € G(X) the subspace of indecomposable G- 
invariant probability measures on X (the definitions of indecomposable 
measure, and of related ergodic notions, are recalled in Section 7). 
In the classical case of G the additive group R of real numbers and 
X a metrizable compact space, Bogolyubov and Krylov established in 
[KrBo-37] three fundamental results, that we quote essentially as Fomin 
does in [Fomi-—50, § 2): 


(1) PR(X) isa non-empty convex compact subspace in the appropri- 
ate topological vector space. 

(2) PR(X) is the convex closure of the space ER(X) of ergodic 
invariant probability measures; in particular € R(X) is non-empty. 
(‘Ergodic’ is used today, but ‘transitive’ is used in [KrBo-37, 
Bogo-39, Fomi-50].) 

(3) For every w € €®(X), there exists an R-invariant Borel subset 
E, C X with the following properties: w(£,,) = 1, and p’(E,) = 0 
for all p’ € ER(X), w! Ap. 


See also [Oxto-52] for G = Z (instead of R). 

The generalization from R to other topological groups raises several 
problems and justifies new notions, in particular that of amenable 
groups, for which (1) holds (Proposition 3.6 below). When (1) holds, (2) 
holds without further restriction on G, as it is indicated in [Bogo-39]; 
this follows alternatively from the Krein—-Milman theorem [KrMi-40]. 
Result (3) holds more generally when G is a second countable locally 
compact group, as exposed in [Vara-63] and [GrSc—00, Theorem 1.1(3)], 
see Section 7; but (3) does not hold for all groups, as shown by the 
example of Kolmogorov discussed in Section 8. 

It is likely that Bogolyubov, and perhaps later also Fomin, were not 
aware of von Neumann’s and Tarski’s 1929 articles when they wrote 
[Bogo-39] and [Fomi—50]. Moreover, Bogolyubov’s 1939 article had very 
little impact at the time (see [Anos—94], in particular the bottom of 
Page 10). Indeed, neither [Bogo-39] nor [Fomi-50] appears in the lists 
of references of any of [Dixm-—50], [HeRo-63], [Rick-67], [Gree-69], or 
[Wago-85]. 

Nevertheless, in his 1939 article, Bogolyubov shows that (1) above 
holds for every topological group which is B-amenable, and the same 
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proof shows that this extends to amenable topological groups. In § 2 
of [Fomi-50], Fomin shows that the following classes of groups are 
B-amenable: compact groups (his Theorem 3), groups containing a 
cocompact closed B-amenable subgroup (Theorem 4), abelian groups 
(Theorem 5), and solvable groups (Corollary 2). For Fomin’s work in 
general during the 1950s, see [Ale—76]. 

A possible other origin of amenability could be an article [Ahlf-35] 
on Nevanlinna theory, where Ahlfors defines ‘regularly exhaustible’ open 
Riemann surfaces, i.e. surfaces S with a nested sequence Q1 C Q2 C-::: 
of domains with compact closures and smooth boundaries, such that 


: length, (Qn 
S = Ur Qn, and such that limp oo a = 0, where the 


subscript g refers to some metric in the conformal class defined by 
the complex structure on $. Ahlfors has used such sequences to define 
averaging processes, as Fglner did later with ‘F¢lner sequences’ in groups. 
The notion of regular exhaustion has natural formulations in Riemannian 
geometry; from several possible references on this subject, we will only 
quote [Roe-88]. The connection between Ahlfors’ regular exhaustions 
and amenability seems rather obvious now, but we are not aware of 
any discussion of it in the literature before the 1980s. Apparently, the 
connection could only be observed after amenability was recognized as 
a metric property of both groups and spaces. 

Though we will not discuss it here, we note that the notion of 
amenability has been extended to many other objects than groups, 
including semi-groups, associative algebras, Banach algebras, operator 
algebras (nuclearity, exactness, injectivity), metric spaces, equivalence 
relations, group actions, foliations, and groupoids. 


3. Amenability, Fixed Point Property, 
and B-amenability 


Let X be a topological space. Let C>(X) denote the Banach space of 
bounded continuous real-valued functions on X, with the sup norm 
defined by ||f|| = sup;ex |f(z)| for f € C(X). When X is compact, 
every continuous function on X is bounded, and we rather write C(X) 
for CP(X). 

Let G be a topological group. For a € G and f € C(G), define gf and 
fa by af (g) = f(a-'g) and f,(g) = f(ga~') for all g € G. Observe that, 
for a given f € C>(G), the map G —> C(G), a H> af need not be 
continuous (example with G = R and f(t) = sin(zt?) for all t € R). Let 
CG) denote the subspace of C®(G) of those functions f for which gf 
depends continuously of a, i.e. the space of! bounded right-uniformly 


1 Here is a justification of the word ‘right’. The right-uniform structure on G is 
rightfully the uniform structure with entourages of the form Uy = {(g,h) € Gx G | 
hg! € V}, for some neighbourhood V of 1 in G; this structure is invariant by right 
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continuous functions on G. This is a Banach space, indeed a closed 
subspace of the Banach space C°(G). [We use a—' in the definition of gf, 
so that (a,f) +> af defines a left-action of G on C>(G) and CG): 
some authors use a rather than a~!; this does not change the definition 
of cb (G),] Similarly, the space cP (G) of bounded left-uniformly 
continuous functions on G, i.e. the space of those f € C>(G) for 
which f, depends continuously on a, is a closed subspace of CPG): We 
denote by Ce the intersection ClrG) nN cb (G). 

Let F be a linear subspace of C b(G) containing the constant functions. 
A mean on F is a linear form M on E that is positive, i.e. M(f) > 0 
whenever f(g) > 0 for all g € G, and normalized, i.e. M(1) = 1, 
where 1 denotes both the number 1 € R and the corresponding constant 
function on G. Equivalently, a mean is a linear form on F such that 


inf f(g) < M(f) < supf(g) V fee. 

geG geG 
Observe that a mean M on F is bounded of norm 1. Assume moreover 
that £ is such that af is in F for all f € EF and ae G. A mean M on 
F is left-invariant if M(gf) = M(f) for all f € F and ae G. Right- 
invariant means are defined similarly. Assume that F is such that both 
af and fy are in EF for all f € EF and a € G; a mean on F is bi-invariant 
if it is both left-invariant and right-invariant. 


Proposition 3.1. For a topological group G, the following properties 
are equivalent: 


(i) there exists a left-invariant mean on rele (G), 
(ii) there exists a right-invariant mean on Cp, (G), 


and they imply 
(iit) there exists a bi-invariant mean on C?(G). 


Proof. For a function f defined on G, denote by fY the function g +> 
f(g7'). Observe that (fY)Y = f, and that, for f € C>(G), we have 
f €C>,(G) if and only if fY € CP,(G). 


multiplications. Hence a function f : G —> R is right-uniformly continuous if, 
for all ¢ > 0, there exists a neighbourhood V of 1 in G such that 
ghee Uy => IfW-fMl<e, 
i.e. aeV => I|flag)-—fi@|<e V geEG, 
i.e. aeVv => laf —-fill <e. 
This is why we use ‘right’ here, as, for example, in [Rick—67], even though, in the 


last inequality, a appears on the left of f; other authors use ‘left’ at the same place 
[Zimm-—84, page 136]. 
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If there exists a left-invariant mean M on OP EG): then f +> M(fY) 
is a right-invariant mean on cP (@). Hence (i) implies (ii). Similarly, (ii) 
implies (i). 

Assume that there exist a left-invariant mean My on GPG) and a 
right-invariant mean M; on CP (G). For f € CO); define first Fy : 
G —> R by Fy(a) = Me(fa); then Fy € GPG): define now a linear 
form M on Ce by M(f) = Mr(F). Then M is a bi-invariant mean 


on C>(G). This shows that (i) or/and (ii) implies (iii). 


We do not know whether (iii) implies (i) and (ii). It does for locally 
compact groups: see ‘References for the proof’ after Proposition 3.10. 


Definition 3.2. A topological group G is amenable if it has 
Properties (i) and (ii) of the previous proposition. 


We write LCTVS as a shorthand for ‘locally convex topological vector 
space’, here on the field of real numbers. Let C' be a convex subspace of 
an LCTVS; a transformation g of C is affine if 


g(cz + (1—c)y) = g(t) + A — c)g(y) 


for all x,y € C and c € R with 0 < c < 1. An action of a topological 
group G on C is continuous if the corresponding map G x C —> C is 
continuous, and affine if z +> g(z) is affine for all g € G. Group actions 
below are actions from the left, unless explicitly written otherwise. 


Definition 3.3. A topological group G has the fixed point property 
(FP) if every continuous affine action of G on a non-empty compact 
convex set in an LCTVS has a fixed point. 


Definition 3.3 plays a fundamental role in a work of Furstenberg 
[Furs—63]. 

It is straightforward to check that compact groups have Property (FP). 
For abelian groups, the following fixed-point theorem appeared first in 
[Mark-—36] and [Kaku-38]; a convenient reference is [Edwa-65, Theorem 
Soh) 


Theorem 3.4 (Markov—Kakutani theorem). Every abelian group has 
Property (FP). 


The result carries over from abelian groups to solvable groups, 
by Proposition 4.1(3) below, and from solvable groups to topological 
solvable groups, obviously. Therefore, we have also: 


Corollary 3.5. Every solvable topological group has Property (FP). 


Proposition 3.6. For a topological group G, the following three 
properties are equivalent: 


(i) G has Property (FP); 
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(ii) G is amenable; 
(iti) for every non-empty compact space X and every continuous action 
of G on X, there exists a G-invariant probability measure on X. 


Before the proof, we recall two standard facts from functional analysis. 

Unit Balls in Duals of Banach Spaces. For a real Banach space FE, we 
denote by E* its Banach space dual, and Ef the unit ball in E*. On 
E* and Ey, we consider also the weak-*-topology, for which E* is an 
LCTVS and E} a compact space (this is the Banach—Alaoglu theorem). 

Barycentres. Let C be a non-empty compact convex subspace of an 
LCTVS E, and let uu be a probability measure on C’. Then there exists 
a unique point 6, € E such that f(by) = Sof @)du(2) for every 
continuous linear form f on £; moreover 6, € C’. The point 6, is called 
the barycentre of yz, or the resultant of w, and w represents b,,. Recall 
the following formulation of the Krein—Milman theorem: every point of 
C is the barycentre of a probability measure on C’, which is supported 
by the closure of the set of extreme points of C; see [Phel-66, Section 1]. 


Proof. (i) > (ii) The set Mean(G) of all means on C>,(G) is a subspace 
of the unit ball cb (Gyr, and is closed for the weak-*-topology, so that 
Mean(G) is a compact convex set. Moreover the natural action of G 
on Mean(G) is affine and continuous (in the sense recalled just before 
Definition 3.3). 

If G has Property (i), there exists a G-invariant probability measure 
jw on Mean(G). Such a measure has a barycentre M € Mean(G). Since 
M is G-invariant (by uniqueness of the barycentre), G has Property (ii). 

[It is even more straightforward to check that (i) implies (iii), because 
if G acts on X, then G acts on the space P(X) of probability measures 
on X, that is naturally a compact convex set.] 

(ii) => (iii) We reformulate the argument of [Bogo-39], written there 
for C(G); the same argument applies equally well to CPG), as can be 
read for example in [Rick-67]. 

Let v be a probability measure on X. For f € C(X), define a function 


Fy : G—R, gr [ f(gz)dv(a). 
xX 


Then Ff is bounded and right-uniformly continuous on G. Let moreover 
aeé G. Then 


Fu p(9) = [fete ave peia=lgy = GERD\@) 


for all g € G. 
b 


Let M be left-invariant mean on C,,,(G). Define a linear form 


ww: C(X)—>R, fro MCF). 
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Then yw is a normalized positive linear form on C(X), i.e. w can be seen 
as a probability measure on X. Since M is left-invariant, we have 


Mal) = M(Fu pf) = M(a(Fp)) = MEP) = 2) 


for all f € C(X) and a € G, i.e. the measure pu is G-invariant. 

(iii) = (i) Let C be a compact convex set on which G acts 
continuously, by affine transformations. If G has Property (iii), there 
exists a G-invariant probability measure yz on C’. The barycentre b, € C 
of yw is fixed by G. 


Remark 3.7. (1) The equivalence of (i) and (ii) appears in many 
places, for example in [Eyma—75], who quotes [Day—61] (as much as 
we know the first article where the arguments are found, but applied to 
abstract groups only) and Rickert (as much as we know the first author 
who applies the arguments to topological groups); see more precisely 
[Rick-67, Theorem 4.2]. 

(2) For a direct proof of (ii) = (i), see also [BeHV—08, Theorem G.1.7]. 

(3) There is a natural embedding G —> Cone, gro (FR F(g)), 
which is continuous for the weak-*-topology on the range (recall that 
the subscript 1 stands for ‘unit ball’). By definition, the universal 
equivariant compactification y,,G of G is the closure of the image of 
this embedding, and the natural action of G on y,G is continuous. The 
properties of Proposition 3.6 are moreover equivalent to 


(iv) the natural action of G on its universal equivariant compactifica- 
tion vy,G has an invariant probability measure, 


as it is shown in [BaBo-1]]. 
(4) Finally, Properties (i) to (iv) are equivalent to 


(v) every continuous action of G on the Hilbert cube has an invariant 
probability measure, 


as has been established in [AIMP-11], and previously in [BoFe—07] for 
countable groups. For a countable group I’, amenability is also equivalent 
to the following property [GiHa-97]: 


(vi) every action by homeomorphisms of FP on the Cantor middle-third 
space has an invariant probability measure. 


The end of this section is devoted to a notion stronger than 
amenability. It was introduced by Bogolyubov [Bogo-39], without a 
name, and appears in [Rick—67, Definition 2.1], unfortunately with a 
name used most often later for the notion of our Definition 3.2; we call it 
B-amenability (Definition 3.8). Historically, B-amenability came before 
amenability for topological groups. Amenability, equivalent to Property 
(FP) of Definition 3.3, has been so far more often useful than 
B-amenability, and deserves the shorter name. (See also Remark 3.15.) 
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Definition 3.8. A topological group G is B-amenable if there exists 
a left-invariant mean on C?(G). 


Proposition 3.9. A B-amenable topological group is amenable. 


Proof. If G is a topological group, the space cGy is a G-invariant 
subspace of COC: and the restriction to this subspace of a G-invariant 
mean on C>(G) is a G-invariant mean on ch(G), 


Propositions 3.10 and 5.2 establish that the converse holds for locally 
compact groups, but not in general. 


Proposition 3.10. Let G be a locally compact group. Then G is 
B-amenable if and only if G is amenable. 


Reference for the Proof. See [Gree-69, Theorem 2.2.1 page 26, and 
page 29]. Greenleaf shows there that the following properties of a locally 
compact group G are equivalent: existence of a left-invariant mean on 
each of 


(i) the space L(G) of essentially bounded Borel measurable func- 
tions (modulo equality in complements of locally null sets), 
(ii) the space C>(G), 
(iii) the space C>,(G), 
(iv) the space C>(G). 


Moreover, Properties (i) to (iv) are equivalent to 
(*) there exists a bi-invariant mean on £, 


for E any one of the spaces in (i) to (iv) above. 

Indeed, a substantial part of the early theory of amenability on locally 
compact groups is to show that infinitely many definitions are equivalent 
with each other. 


Example 3.11. Compact groups are B-amenable, since the normalized 
Haar measure on a compact group G provides a mean on C>(G) that is 
both left- and right-invariant. 


Proposition 3.12. For a topological group G, the following two 
properties are equivalent: 


(i) G is B-amenable, 
(ii) for alin = 1, f,...,f™ | C>(@, a,...,an € G, andt eR, 
such that f) — qfO +---+f™ —4,f™ = t, we have t < 0. 


On the Proof. The non-trivial implication is (ii) > (i). It is a conse- 
quence of the Hahn—Banach theorem; see [Dixm—50, Théoréme 1]. 


This has the following consequence, for which we refer again to Dixmier 
[Dixm-50, Théoréme 2(a)]: 
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Proposition 3.13. Abelian topological groups are B-amenable. 


By Proposition 4.3, we have also the following corollary (compare 
with 3.5): 


Corollary 3.14. Solvable topological groups are B-amenable. 


Remark 3.15 (on terminology). In a first version of the present article, 
we used ‘strongly amenable’ for ‘B-amenable’. But this was unfortunate, 
because the terminology is already used for groups of which the universal 
minimal proximal flow is trivial, and this is a different notion, as there 
is an example of Furstenberg which shows that a solvable group need 
not be strongly amenable [Glas—76, page 28]. Strong amenability plays 
its role in more recent work [MeNT-16, Section 4]. 

The terminology ‘u-amenable’ and ‘amenable’ is used in [Harp—73], 
instead of ‘amenable’ and ‘B-amenable’ here. 


4. Hereditary Properties 


In most of this section, we address topological groups in general. 
However, groups are assumed to be locally compact in Proposition 4.4, 
and metrizable in Corollary 4.6. 

A topological group G is the directed union of a family (Hy)ge4 of 
closed subgroups of G if the following conditions hold: (1) G = Uye4 Aa; 
(2) for every a, B € A, there exists y € A such that Hy U Hg C Hy; (3) G 
has the topology of the inductive limit of the Hy, ’s. [Note that the index 
set A is a directed set for the preorder defined by a < B if Hy C Hg.] 


Proposition 4.1 (on amenability). Let G be a topological group. 


(1) If G is amenable, then every open subgroup of G is amenable. 

(2) If G is a directed union of a family (Ha)acA of closed subgroups, 

and if each Hy is amenable, then G is amenable. 

(3) If G has an amenable closed normal subgroup N such that the 

quotient G/N is amenable, then G is amenable. 

(4) Let H be a topological group such that there exists a continuous 

homomorphism H —> G with dense image; if H is amenable, 

then so is G. 

(5) If H is a dense subgroup of G, then G is amenable if and only if 
H, endowed with the induced topology, is amenable. 


On the Proof. For each claim, there is at least one property of Propo- 
sition 3.6 that makes the proof rather straightforward. As a sample, we 
check the last claim, and refer to [Rick—-67, Section 4] for (1) to (4). 

(5) Because right-uniformly continuous functions can be extended 
from H to G, the restriction of functions provides an isomorphism 
of Banach spaces COL — ch A ), by which these spaces can be 
identified; let us denote it (them) by CP. 
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On the one hand, a left-G-invariant mean on CP, is obviously a left-H- 
invariant mean on C>,. On the other hand, since the action of G on CP, is 
continuous, a left-H-invariant mean on this space is also left-G-invariant. 
Claim (5) follows. 


Remark 4.2. (1) In the first claim of Proposition 4.1, ‘open’ cannot be 
replaced by ‘closed’ (see Corollary 5.4). See, however, Propositions 4.4 
and 4.5. 

(2) Let G be a group with two Hausdorff topologies Ts, 7 such that 
G, :=(G,7,;) and Gy := (G, Ty) are topological groups, and 7, stronger 
than Jy. If Gs is amenable, then Gy is amenable, as it follows from Claim 
(4) applied to the continuous identity homomorphism id: Gs; —> Gy. 

Suppose, for example, that 7; is the discrete topology and that G; is 
amenable; this is the case if G is abelian, or more generally solvable, by 
Theorem 3.4. Then Gy is amenable for every topology 7, making G a 
topological group. 

(3) The proof of Claim (5) cannot be adapted to Proposition 4.3. 
Indeed the analogue of Claim (5) does not hold for B-amenability 
(Remark 5.3(1)). 

(4) The following result of Calvin C. Moore (1979) is reminiscent of 
Claim (5), but the proof uses completely different notions. Let G be the 
group of real points of an R-algebraic group, H an amenable topological 
group, and g: H —> G a continuous homomorphism; if H is amenable 
and y(H) Zariski-dense in G, then G is amenable. We refer to [Zimm-—84, 
Theorem 4.1.15]. 


Proposition 4.3 (on B-amenability). Let G be a topological group. 


(1) If G is B-amenable, then every open subgroup of G is B-amenable. 

(2) If G is a directed union of a family (Ha)weA of closed subgroups, 
and if each Hy is B-amenable, then G is B-amenable. 

(3) If G has a B-amenable closed normal subgroup N such that the 
quotient G/N is B-amenable, then G is B-amenable. 

(4) Let H be a topological group such that there exists a continuous 
epimorphism H — G; if H is B-amenable, then so is G. 


Reference for the Proof. As for Proposition 4.1, proofs are straightfor- 
ward. In [Rick-67], see respectively Theorems 3.2, 2.4, 2.6, and 2.2. There 
are related results in the older article by Dixmier [Dixm—50]. 


Amenability of closed subgroups of amenable groups is more subtle. 
Let us first recall the following result in the classical setting of locally 
compact groups. 


Proposition 4.4. Let G be a locally compact group and H a closed 
subgroup. If G is amenable, then so is H. 


On the Proof. We mention here two proofs of this statement. 
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The proof of [Zimm-—84, Proposition 4.2.20] uses induction from H to 
G for actions on compact convex sets. Since Haar measure is an essential 
ingredient of induction, this cannot be used for groups that are not 
locally compact. 

The proof of [Gree-69, Theorem 2.3.2] has three steps. We denote by 
H\G the space of H-cosets of the form Hg; we could use G/H instead, 
but this would impose on us right-invariant means on C>(.), rather than 
left-invariant means. 

For the first step, G is assumed to be second countable. Then there 
exists a Borel transversal T for H\G, so that the multiplication map 
Hx T — G, (h,t) +> At, is a Borel isomorphism. This can be used 
first to extend functions on H to functions on G, and then to show that 
amenability is inherited from G to H (we refer to Greenleaf’s book for 
details). For the second step, G is assumed to be o-compact. Then G 
has a compact normal subgroup N such that G/N is second countable 
(Kakutani-Kodaira theorem [KaKo-44]). The first step and Proposition 
4.1(3) imply again that amenability is inherited from G to H. The final 
step makes use of the fact that every locally compact group contains an 
open o-compact subgroup. 

In Rickert’s article, the proposition is proved with additional hypoth- 
esis only, essentially that the quotient of G by its connected component 
is a compact group [Rick-67, Section 7]. 


Corollary 5.4 below shows that Proposition 4.4 does not carry over to 
arbitrary topological groups. In particular this proposition is unlikely to 
have a completely straightforward proof. 

However, with appropriate extra hypothesis, B-amenability is inher- 
ited by closed subgroups. This is shown by the following proposition, 
which is Theorem 3.4 in [Rick-67]. 


Proposition 4.5. Let G be a topological group, and H a closed 
subgroup. Assume that H\G is paracompact and the fibration x : G — 
A\G is locally trivial. 

If G is B-amenable, then so is H. 


Proof. Because G — H\G is locally trivial, there exist an open cover 
(U,)c7 of H\G and a family of continuous sections (o, : U, — G)e] 
such that wo,(z) = « for allv € J and « € U,. Given that H\G is 
paracompact, there is a partition of unity (¢,),<7 subordinate to (U,),er- 
For f € C>(H), define Fy : G —> R by 


Fr(g) = Y> gilx(g)) f (sox) 


vel 


The function Fr is well-defined, continuous because the family of the 
supports of the g, ’s is locally finite, and obviously bounded. The 
assignment f +— F'r is a linear map from Ch(H ) to Cre, that respects 
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positivity and constant functions. Moreover, for f € CP(H) and ae H, 
we have 


alFf) = Fup): 


Indeed 
(Fp) (9) = Yo eer(a'g)) f (a1 g(ox(a-"g))-*) 
vel 
= Vac@@ f(a'goxa'g)) = (Fa) 
vel 
for allg € G. 


Suppose that there exists a left-G-invariant mean M on C(G). The 
assignment m : f +— M(Ff) is obviously a mean on C>(H). Since M is 
invariant, we have 


Maf) = MFup) = MalFs)) = MFP) = mF) 


for all f € C(H) and a € H, i.e. m is a left-H-invariant mean on 
c>(H). 


Corollary 4.6. Let G be a metrizable topological group. If G contains 
a non-amenable discrete subgroup T, then G is not B-amenable. 


Proof. Since G is metrizable, so is [\G, hence ['\G is paracompact. 
Let p : G —+ I\G denote the canonical projection. Let V be a 
neighbourhood of 1 in G such that V-! = V and TN V? = {1}. For 
every g € G, the open subsets y Vg, for y € I, are pairwise disjoint; it 
follows that p'(p(Vg)) is homeomorphic to the product T x Vg, and 
therefore that the fibration p is locally trivial. 
Hence G is not B-amenable by Proposition 4.5. 


Concerning the hypothesis of Corollary 4.6, let us recall a theorem due 
to Birkhoff and Kakutani; for a topological group G, the three following 
conditions are equivalent: 


(i) G is metrizable as a topological space; 
(ii) the topology of G can be defined by a left-invariant metric; 
(iii) G has a countable basis of neighbourhoods of 1. 


See, for example, [BTG5-10, § 3, no 1, pages IX.23-24], or [CoHa-16]. 


Example 4.7. In Proposition 4.5, the property of local triviality is not 
automatic, as the following example, from [Karu—58], shows. 

Consider the circle group T = R/Z, its subgroup C of order 2, the 
compact group G = TN, and the closed subgroup H = CN of G. 
Then G is locally connected, as a product of (locally) connected groups, 
while H and H x (H\G) are not locally connected (indeed H is 
totally disconnected). It follows that G and H x (H\G) are not locally 
homeomorphic, and in particular that the projection G — H\G is not 
locally trivial. 
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5. Examples 


Propositions 5.2, 5.5, and 5.6 provide examples of topological groups 
that are amenable and are not B-amenable. By necessity, these groups 
are not locally compact (Proposition 3.10). 

Let H be an infinite dimensional complex Hilbert space. We denote 
by U(H)str its unitary group, endowed with the strong topology (equiv- 
alently: with the weak topology); as is well known, this is a topological 
group, and it is not locally compact. For the strong and weak topologies 
on sets of operators, and their properties, see e.g. [Dixm—57]. If H is 
separable, U(H)str is a Polish group, i.e. it is separable and its topology 
can be defined by a complete metric. As a curiosity, we note that, for a 
separable Hilbert space H, the strong topology is the unique topology 
for which U(H) is a Polish group [AtKa-12]. 


Lemma 5.1. Let H be an infinite dimensional separable complex 
Hilbert space, and U(H) sir its unitary group. 

Every countable group T° is isomorphic to a discrete subgroup of 
U(A) str. 


Proof. Suppose first that IP is an infinite group. We can identify H with 
the Hilbert space €2([) of complex-valued functions € on I such that 
Dyer l&()? < 00. Let 


4 EF Ss U2 ste 


be the left-regular representation of I’, defined by (A(y)é)(y’) = E(v! y’) 
for all y,y’ €T and & € £2(L). 

For y €f, let 6, € £2(T) denote the unit vector defined by 6,(v) =1 
and 6,(y’) = 0 if y’ # y; observe that A(y)d1 = 4,. Set 


Uy = {ge VOEMD)str | Ig — AO) < ¥2/2I 


On the one hand, U,, is open in U(l2(L))str and A(y) € U,. On the other 
hand, since ||6, — 6,/|| = J2fory,y’eT,y #y’, the U, ’s are pairwise 
disjoint. Hence A(T’) is a discrete subgroup of U(l2(L))str- 

If T is a finite group, we can apply the previous argument to the direct 
product Tl x Z, and use the embedding [ ~T x {0} C Z. 


Note. More generally, for every locally compact group G, the left- 
regular representation G — U (L?(G))str is both a continuous homo- 
morphism and a homeomorphism of G onto a closed subgroup of 
U(L?(G))str [BeHV—08, Exercise G.6.4]. 

In general, L?(G) need not be separable. It is when G is separable. 


Proposition 5.2 ({Harp—73]). Let H be an infinite dimensional sepa- 
rable complex Hilbert space. The topological group U(H) str is amenable 
and is not B-amenable. 
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Remark. Proposition 6.3 establishes that U(H)gstr has a property much 
stronger than amenability. 


Proof. Let (€n)n>1 be an orthonormal basis of H. For each n > 1, we 
identify the compact Lie group U(n) to the subgroup of U(H)str of those 
unitary operators leaving invariant the linear span Vy, of {e1,..., en} 
and coinciding with the identity on the orthogonal complement V;-. Let 
U(co) denote the union of the compact groups in the nested sequence 


Ud) c---cC UM) CUMmM+I1)c::-, 


with the inductive limit topology. The group U(oc) is dense in U(H)str. 

The later claim follows from Kaplansky’s density theorem, essentially 
the version of [Pede-79, Theorem 2.3.3]. Pedersen formulates his The- 
orem 2.3.3 in terms of a C*-subalgebra A of L(H). More generally, his 
proof applies without change to an involutive subalgebra A of L(H) such 
that exp(z) € A for all z € A. In our case, A is the algebra spanned by 
the identity operator and J,,.; Mn, where M,, stands for the the finite- 
dimensional algebra M, = {x € L(H) | (Vn) C Vn and a(V,") = {O}}. 

For the inductive limit topology, the group U(co) is a topological 
group. It is not locally compact; indeed, it is not a Baire group, because 
U(n) has empty interior in U(co) for all n. (For other topological 
properties of U(oo), see e.g. [Hans—71, Theorem 4.8]). Since the compact 
groups U(n) are amenable, so is U(oo) by Proposition 4.1(2). It follows 
from Proposition 4.1(5) that U(H)str is amenable. 

By Lemma 5.1, the group U(H)str contains non-amenable discrete 
subgroups. It follows from Corollary 4.6 that U(H)str is not B-amenable. 

The proof of the proposition is complete. Let us, however, reproduce 
the argument of [Harp—73], that is a different proof that U(H)str is not 
B-amenable. 

Suppose ab absurdo that there exists a left-invariant mean M on the 
space C>(U(H)str). Let &€ be a unit vector in H. For every bounded 
operator S on H, the function 


UMW str — R 
g > Re((g7!Sg6 | €)) 


is bounded and continuous. Let L(H) denote the algebra of all bounded 
operators on H. Observe that, for h € U(H)str, we have 


nfse(g) = Re ((g7thSh™"g& | €)) 


for all gE U(H)str, 1.e. ad'S,é = Shsh-1,8: 
Consider the linear form 


(LG: 3? oR 
aa) |e ee oe cars) 


fs,g : 
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Because M is left-invariant, we have, for all S € L(H) and h € U(N)str, 


t(hSh-*) = M(fngnae) = Mts) = MCs) = te(S), 


and therefore also t;(Sh) = tz (hS). 

Every operator in L(H)str is a linear combination of unitaries.2 Hence 
te(ST’) = te(TS) for all S, T € L(H). Because the identity operator is 
a sum of two commutators, i.e. since idy is of the form 51,7), — TS, + 
52 T2 — T259 (see e.g. Problem 186 in [Halm-67]), we have t¢(idy) = 0. 
But this is preposterous, because Tz (idz,) = M(1) = 1. Hence U(H)str is 
not B-amenable. 


Remark 5.3. (1) Let the notation be as in the proof above. Because the 
group U(oo) is B-amenable (by Claim (2) of Proposition 4.3) and dense 
in U(H)str, Proposition 5.2 justifies Remark 4.2(3). 

(2) Proposition 5.2 has the following offspring. Let M be a von 
Neumann algebra, realized as a weakly closed x*-subalgebra of L(H), 
for some separable Hilbert space H. Let U(M)str be its unitary group, 
with the strong topology. Then U(M)str is amenable if and only 
if M is injective [Harp-79]. For a C*-algebra A, there is a similar 
characterization of nuclearity of A in terms of amenability of the unitary 
group U(A) of A, with the norm topology [Pate—92]. 

(3) Consider the Banach algebra L(H) of all bounded operators on H, 
with the usual operator norm. Let GL(H)norm be its general linear group, 
with its topology as an open subset of L(H). We denote by U(H)norm the 
unitary group of H, with the topology induced by the operator norm; it is 
a closed subgroup of GL(H)norm. The groups U(H) norm and GL(H) norm 
are not amenable. Also, the group GL(£”)norm is not amenable, for p 
with 1 < p < ov. For this (and more), see [Harp-73] and [Pest—06, 
Examples 3.6.13-15). 


From the proof of Proposition 5.2, let us extract the following 
observation. It appeared in [Harp-82, page 489]. 


Corollary 5.4. A closed subgroup of an amenable topological group 
need not be amenable. 


Let Sym(N) denote the full symmetric group of the positive integers, 
with its standard Polish topology (any infinite countable set would do 
instead of N). There is a curiosity similar to that noted just before 
Lemma 5.1: the group Sym(N) has a unique topology making it a 
Polish group [Gaug-67, Kall-79], indeed a unique topology making it 


2 Let A be a C*-algebra with unit. Every x € A is a linear combination of four 
unitaries. Indeed, since « = 4(a + x*) 4 a (ix — ix*), it is enough to check that a 
self-adjoint element of norm at most 1 is a linear combination of two unitaries. If 
x* = x and |\z|| < 1, then u=a2+iV1— <2? and u* = ¢—iV1-— 2? are unitary, and 
2£= d(u +u*). 
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a separable Hausdorff group [KeRo-07, Theorem 1.11]. We denote by 
Symy(N) the subgroup of Sym(N) of permutations with finite support; 
it is a locally finite group, dense in Sym(N). 

As in Proposition 5.2, we have: 


Proposition 5.5. The topological group Sym(N) is amenable and is 
not B-amenable. 


Proof. The proof is analogous to that of Proposition 5.2. On the one 
hand, since Sym(N) contains a dense subgroup that is locally finite 
and therefore amenable, Sym(N) is amenable by Proposition 4.1. On 
the other hand, N can be identified (as a set) with a non-amenable 
countable group I’, and the metrizable group Sym(I) contains [as a 
discrete closed subgroup, so that Sym(N) ~ Sym(1) is not B-amenable, 
by Corollary 4.6. 


Let V be a vector space over a finite field GF(q). Assume that the 
dimension of V is infinite and countable. Observe that V is a countable 
infinite set, and that the general linear group GL(V) is a subgroup of 
Sym(V) ~ Sym(N). 

Moreover, each of the equations 


g(0) =0, with 0 the origin of V, 
gv) =Ag(v), with A € GF(q)* and ve V, 
guvu+w)=g(v)+g(w), with v,we V, 


defines a closed subset of Sym(V). Hence GL(V) is a closed subgroup 
of Sym(V); in particular GL(V) itself is a Polish group. 


Proposition 5.6. The topological group GL(V) ts amenable and is not 
B-amenable. 


Proof. This is one more variation on the same proof, as for Propositions 
5.2 and 5.5. Let (en)nen be a basis of V. For every n € N, denote by Vy 
the linear span of {e1, €2,..., én}, and by GLy the subgroup of GL(V) 
consisting of those elements g such that g(Vn) = Vn and g(ex) = ex for 
all k > n. We have a nested sequence 


[o,@) 
GL1 C++» C GLn C GLn C++ C GLy = |) GLn 
n=1 


of which the union GLy is locally finite, in particular amenable, and 
dense in GL(V). Hence GL(V) is amenable. 

The space V has a basis (e,),cr indexed by a non-abelian free 
group F’. Each yo € F can be viewed as an element of GL(V) mapping 
ey tO €yy for all y ¢ F. This shows that GL(V) contains a discrete 
closed subgroup isomorphic to F’, and in particular that GL(V) is not 
B-amenable. 
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Example 5.7. Let k be a commutative ring, with unit. Denote by 
J(k) the substitution group of formal power series over k, with elements 
of the form f(z) = 4+ > j.9 a;z*, where a; € k, and with substitution 
for the group law. For what follows, and much more, about groups of 
this kind, see [Babe—13]. 

For each n > 1, denote by 7”*+!(k) the normal subgroup of 7 (k) 
defined by the equations ag = a3 = --- = Gn41 = O and by Jn(k) 
the quotient 7(k)/7 n+1(k), There is a natural bijection between 7; (k) 
and k”. When k is a topological ring, we use this bijection to define a 
topology on Jn (k); it is a group topology. Then 7 (k) is also a topological 
group, with the topology of the inverse limit lim, Ink). This topology 
is interesting even if the ring k is discrete; note that the group 7 (k) is 
profinite when the ring k is finite. 

It is known that, for every topological commutative ring k, the group 
J(k) is amenable [BaBo-11]. According to [Babe-13, page 61], it is not 
known whether the group /(Z) is B-amenable. 


6. Extreme Amenability 


Several of the examples we know of topological groups that are amenable 
and not locally compact have a property stronger than amenability: 
that of extreme amenability. The notion appeared in the mid 1960s. 
The subject became more important with later articles, such as those 
of Gromov—Milman [GrMi-83], written in the late 1970s, and Kechris— 
Pestov-Todorcevic [KePT-—05]. For indications on the development of 
the subject, see the introduction of [Pest—06]. 


Definition 6.1. A topological group G is extremely amenable if 
every continuous action of G on a compact space has a fixed point. 


In the next proposition, Items (1) to (4) are reminiscent of similar 
items in Propositions 4.1 and 4.3. 


Proposition 6.2 (hereditary properties of extreme amenability). Let 
G be a topological group. 


(0) If G is extremely amenable, then G is amenable. 

(1) If G is extremely amenable, every open subgroup of G is extremely 

amenable. 

(2) If G is a directed union of a family (Ha)acA of closed subgroups, 

and if each Hy is extremely amenable, then G is extremely 

amenable. 

(3) If G has an extremely amenable closed normal subgroup N such 
that the quotient G/N is extremely amenable, then G is extremely 
amenable. 
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(4) Let H be a topological group such that there exists a continuous 
homomorphism H —> G with dense image; if H is extremely 
amenable, then so is G. 

(5) A closed subgroup of an extremely amenable group NEED NOT be 
amenable. 


Proof. Claims (0) and (4) are straightforward. Claim (1) appears in 
[BoPT-11, Lemma 13]. 

(2) Let X be a non-empty compact G-space. For each a € A, the set 
X* of H,-fixed points is closed and therefore compact in X, and non- 
empty by hypothesis on Hy. The intersection (\y~4 X% is non-empty, 
by compactness of X. Because this intersection coincides with the set of 
G-fixed points in X, the group G is extremely amenable. 

For (3), see [Pest—06, Corollary 6.2.10]. 

A smart example confirming (5) is the group Aut(Q) of order- 
preserving permutations of the set Q of rational numbers (see next propo- 
sition), which contains a non-abelian free discrete subgroup [Pest—98, 
Theorem 8.1]. Note that a non-trivial locally compact closed subgroup of 
an extremely amenable group is not extremely amenable, see Proposition 
6.5 below. 


Proposition 6.3. The following groups are extremely amenable. 


(1) The unitary group U(H)str of an infinite dimensional separable 
complex Hilbert space H, with the strong topology. 

(2) The group Aut(Q) of order-preserving permutations of Q, with the 
topology of pointwise convergence on the discrete space Q. 

(3) The group H*(\0,1]) of orientation-preserving homeomorphisms 
of the closed unit interval, with the compact-open topology. 

(4) The isometry group of the Urysohn space. 

(5) The group Aut(X, w) of all measure-preserving automorphisms of 
a standard non-atomic finite or infinite and sigma-finite measure 
space, with the weak topology. 

(6) The group L°(X,B, w; G) of all measurable maps from a Lebesgue 
space with a non-atomic probability measure (X , B, ) to a second- 
countable compact group G, up to equality w-almost everywhere, 
with the topology of convergence in measure. 


References for the Proof. The case of U(H)str is shown in [GrMi-83]; 
for the translation of their result in terms of Definition 6.1; see, 
for example, [Pest-06, Section 2.2]. For Aut(Q) and H*({0,1]), see 
[Pest—98]. 

For the Urysohn space and its isometry group, see {Pest—06], in 
particular Theorem 5.3.10 (the isometry group of the Urysohn space, 
with its standard Polish topology, is a Levy group) and Theorem 4.1.3 
(every Levy group is extremely amenable). 
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For Aut(X, 2), see [GiPe-07, Theorem 4.2]. The case of L°(X, B, uw; G) 
is due to Eli Glasner [Glas—98] and Furstenberg—Weiss [unpublished]; see 
[Pest-06, Section 4.2]. 

For some of these groups, extreme amenability can also be shown by 
the arguments of [MeTs—13]. 


Remark 6.4. Propositions 5.2 and 6.3(1) show that an extremely 
amenable group need not be B-amenable. 

Concerning (1), recall on the one hand that the group U(H)str is 
known to have Kazhdan’s Property (T) [Bekk—03]. On the other hand, 
an amenable locally compact group which has Property (T) is compact. 

Concerning (2), note that not only Aut(Q) with the indicated topology 
is extremely amenable, but moreover every action by homeomorphisms 
of the group Aut(Q) with the discrete topology on a compact metrizable 
space has a fixed point [RoSo-07, Corollary 7]. 

Concerning (3), recall that Thompson’s group F is a dense subgroup 
of Ht ((0, 1]). The only consequence of (3) we can state is that is does 
not exclude that F is amenable. (This is a repetition of Remark 12 of 
[Pest—02].) 


For the next proposition, we use the following notation. 

Sym(N) is the symmetric group of N, with its standard Polish 
topology, as in Proposition 5.5. 

Let p be a positive number with 1 < p < oo and p F 2. Let ¢? be 
the Banach space of sequence (Zn)n>1 of complex numbers such that 
|zI| = (Set |2n|P)/? < oo. We denote by U(¢?) the group of linear 
isometries of £?, with the strong topology. 


Proposition 6.5. The following groups are NOT extremely amenable. 


(1) Any locally compact group G # {1}. 

(2) The symmetric group Sym(N). 

(3) The group H(C) of homeomorphisms of the Cantor space, with the 
compact-open topology. 

(4) The unitary group U(e”), for p with 1 < p < cw and p #2. 

(5) The group GL(V), for V ~ GF(q)™) as in Proposition 5.6. 


References for the Proof. Every locally compact group admits a free 
action on a suitable compact space [Veec-77]; and (1) follows. 

For (2) and Sym(N), see [Pest-98], or [Pest-06, Section 2.4]. As 
Glasner and Weiss have observed, the natural action of Sym(N) on the 
compact space LO Cc N? of all linear orders on N has no fixed point; 
details in [Pest-06, Example 2.4.6, page 47]. 

For (3), observe that the action of #(C) on C has no fixed point. For 
more on actions of this group on compact spaces, see [GlWe-03]. 

For (4), let us reproduce the argument of [Pest-06, Example 3.6.15]. 
Let (€n)neNn be the canonical basis of @?. For every sequence (tn)neN 
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in the compact group [[,en Tn, where each Ty, is a copy of T = 
{z €C | |z|=1}, and every o € Sym(N), there is an isometry of ¢? 
mapping én to tron) for all n € N. Moreover, every isometry of €? 
is of this form; the proof is like that of Banach, for the £?-space of real 
sequence and for t, € {—1, 1} for all n [Bana-32, Chap. XI, § 5, page 178). 
In other terms, the unitary group U(¢?) of €? is a semi-direct product 


UP) = (TI r) x Sym(N). 


neN 


Now Claim (4) follows from (2) and Proposition 6.2(4). 
For (5), see [Pest-06, Example 6.7.17]. 


To conclude this section, we quote a result that extends Proposition 
6.3(1). Compare with the way Remark 5.3(2) extends part of Proposition 
5.2. 


Proposition 6.6. A countably decomposable von Neumann algebra M 
is injective if and only if its unitary group U(M), with the weak topology, 
is the direct product of a compact group and an extremely amenable 
group. 

In particular, an infinite dimensional factor M is injective if and only 
if UCM) is extremely amenable, and the same holds for M a properly 
infinite von Neumann algebra. 


References for the proof:. [GiPe-07, Theorem 3.3] and [GiNg-13]. 


7. On the Definition of Ergodicity 


Let us agree on the following terminology. Consider a Borel action of a 
topological group G on a Borel space (X, 8). A Borel subset A C X is 
invariant by G if gA = A for allge G. 

Assume that we have moreover a G-invariant probability measure jz 
on (X,8). A Borel subset A Cc X is u-essentially invariant by G if 
u(gA A A) = 0 for all g € G (where A indicates a symmetric difference). 
For ‘u-essentially invariant’, Varadarajan uses ‘j-invariant’ [Vara—63, 
page 196], Maitra ‘u-almost invariant’ [Mait—77], and Phelps ‘invariant 
(mod yw)’ [Phel-66]. 


Definition 7.1. Let G be a topological group acting as above on a 
Borel space (X, 8) with a G-invariant probability measure jw. 

The action is w-ergodic if, for every invariant set A € JB, either 
u(A) = 0 or w(X ~\ A) = 0; in this situation, we say also that the 
invariant measure yz is w-ergodic. 

The action is s-ergodic if, for every 1-essentially invariant set A € B, 
either (A) = 0 or w(X \ A) = 0; in this situation, we say also that the 
invariant measure jt is s-ergodic. 
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Some authors use ‘ergodic’ for our ‘w-ergodic’ (see, for example, 
[Zimm-84, beginning of Chapter 2]), others use ‘ergodic’ for our 
‘s-ergodic’ (see, for example, [Phel-66, Chapter 10]). Proposition 7.7 
shows that, in a standard setting, the two notions coincide. 


Remark 7.2. Consider a Borel action of a topological group G on a Borel 
space (X,), and a G-invariant probability measure w on X. Assume 
that yz is s-ergodic. Then every Borel subset A in X such that w(A) = 1 
is u-essentially invariant by G. 

Indeed, let g € G. The subset g(A)\ (AN g(A)) is contained in 
X \A, hence is negligible for w. Similarly, AX (AN g(A)) = g(g- (AN 
(AN g \(A))) is negligible for uw, because uw is G-invariant. Hence 
(AAg(A)) = 0 for all g € G. 

Suppose moreover that X is a compact space, and that B is the o- 
algebra of Borel subsets of X, i.e. the o-algebra of subsets of X generated 
by the open subsets of X. The space P(X) of probability measures on X 
is a compact convex subspace of the dual space of C(X), with the weak- 
«-topology, and the space P(X) of G-invariant probability measures is 
a compact convex subspace of P(X). 


Definition 7.3. With the notation above, a G-invariant measure 
uw € P(X) is indecomposable if it is in the subset of extreme points 
E&(X) of P&(X). 

In other words, ~ is decomposable if there exist two distinct 
measures [11,2 € P(X) and a constant c with 0 < c < 1 such that 
m= ci + (1— c)po. 


The following proposition appears in [Bogo-39]. 


Proposition 7.4. Let G be a topological group acting continuously by 
homeomorphisms on a compact space X, and let uw be a G-invariant 
probability measure on X. The following two properties are equivalent: 


(i) we is indecomposable, 
(ii) pe is s-ergodic; 


and imply the following third property 
(itt) je is w-ergodic. 


Note. We will add another property equivalent to (i) and (ii) in 
Proposition 9.3. 

It is recalled below that (iii) does imply (ii) when G is second countable 
locally compact (Proposition 7.7), but not in general (Proposition 
8.1(3)). 


Proof. For (i) < (ii), we reproduce the proof of [Bogo-39]. This proof 
has remained the standard one: see, for example, [Walt-82, Theorem 
610]. Walters writes his proof for one continuous map T : X —> X, but 
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it works without change in the present situation. For (ii) => (i), see also 
[Phel-66, Proposition 10.4] (Proposition 12.4 of the Second Edition). 

(i) = (ii) We prove the contraposition. Assume that there exists a 
u-essentially G-invariant subset A of X with 0 < w(A) < 1. Then 
m= e(A)ui + (1 — (A) 2, with 


: BNA B : BO(X\A 
uta)“ ), p(B) = T- ma! (X ~ A)), 
for all Borel subsets B of X. 

(ii) => (i) Assume (again by contraposition) that there exist two dis- 
tinct G-invariant probability measures 41, 2 of which w = cy4+ coMe2 
is a convex combination. For every Borel subset B of X with uw(B)=0, 
we have “1(B)=0=,2(B), ie. 41, “2 are absolutely continuous with 
respect to 4. By the Radon—Nikodym theorem (one of many convenient 
references is [Rudi-66, Theorem 6.9]), there exist well-defined and unique 
functions f,, f2 € L'(X, 2), with non-negative values, such that w4 = fip 
and 2 = fou. By uniqueness, fi and f2 are G-invariant. Set 


A= {re X | fi(z) > f(a} and B= {xe X | fi <A}. 


H1(B) = 


Then A, B are p-essentially G-invariant Borel subsets of X of positive 
measure, and constitute a partition of X. 
The implication (ii) => (iii) is trivial. 


Definition 7.5. Let (Q,8) be a Borel space; assume that it is a 
standard Borel space, i.e. that there exist an isomorphism of (Q, B) 
with a Borel subset of a complete separable metric space. Let G be 
a topological group acting in a Borel way on (2,8); assume that there 
exists a probability measure yw on (Q, 8) which is a G-quasi-invariant, 
i.e. such that, for all A Cc B and g € G, we have w(A) = 0 if and only if 
U(gA) = 0. 

Let (Y,C) be another standard Borel space (the most important case 
here is Y = R, with the usual Borel o-algebra). A Borel function f : 
Q — Y is p-essentially G-invariant if, for each g € G, we have 
f(gw) = f(@) for pu-almost all w € Q; it is G-invariant if, for each 
g € G, we have f(gw) = f(@) for allwe Q. 


Lemma 7.6 (Lemma 3.3 in [Vara-63], or Lemma 2.2.16 in [Zimm-84]). 
Let G be a second countable locally compact group acting in a Borel way 
on a standard Borel space Q. Assume that Q has a G-quasi-invariant 
probability measure lL. 

Let Y be a standard Borel space and f : 82 —> Y be a Borel function. 
Assume that f is essentially G-invariant. Then there exists a Borel 
function f: Q — Y which is G-invariant, and f(@) = = f(@) for p- 
almost allw € Q. 

In particular, if A C Q is a w-essentially G-invariant Borel subset, 
there exists a G-invariant Borel set A C Q such that u(AA A) = 0. 
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On the Proof. Note that the particular case follows from the result on 
functions, with Y = R and f the characteristic function of A. 

It is crucial here that G is locally compact (hence G has a Haar 
measure) and ‘not too large’ (more precisely ‘second countable’ in 
[Zimm-84] and [Vara-63]), because the proof relies very strongly on 
Fubini’s theorem, and Fubini’s theorem holds for Haar measure with 
appropriate conditions only. 


Proposition 7.7. Let G be a second countable locally compact group 
acting in a Borel way on a standard Borel space Q. Assume that Q has 
a G-quasi-invariant probability measure [L. 

Then the action is s-ergodic if and only if it 1s w-ergodic. 


Proof. The proposition follows from the lemma and the definitions. 


We quote now the following decomposition theorem. It can be seen 
as an elaboration of results going back to von Neumann, in the early 
1930s. Two convenient references are [Vara—63], for invariant measures, 
and [GrSc-00], for quasi-invariant measures. 


Theorem 7.8 (Ergodic Decomposition Theorem). Let G be a second 
countable locally compact group acting in a Borel way on a standard 
Borel space (X,B); assume that P(X) is non-empty (equivalently that 
E(X)® is non-empty). Denote by (Y,C) the standard Borel space with 
Y = €(X)© andC its Borel o-algebra. 

Then there exists a family (py)yey of probability measures on Y , with 
the following properties: 


(1) for every Borel subset B in X, the map y +> py(B) from Y to 
[0, 1] is Borel; 

2) for every y € Y, the measure py is G-invariant and ergodic; 

y 9g 
3) for y,y' € Y with y # y’, the measures py and py are mutuall 
y y y 

singular; 

(4) for every uw € P(X)%, there exists a probability measure v on 
(Y,C) such that w(B) = fy py(B)dv(y) for every Be B. 


Note: A fortiori, the theorem holds for a continuous action of G on a 
metrizable compact space. 


Corollary 7.9. Let G be a second-countable locally compact group act- 
ing continuously by homeomorphisms on a metrizable compact space X . 
For every u € E%(X), there exists a Borel subset A, of X such that 


w(Ay) = 1 and p'(Ay) = 0 forall p'€ re, S) with pw! A p. 


On the Proof. This follows from a version of the Ergodic Decompo- 
sition Theorem more comprehensive than that of Theorem 7.8, see 
[GrSc-O0]. 
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Theorem 1.4 of [GrSc—00] is such a theorem for a countable discrete 
group, say I’. It is shown there that 


(a) the sub-o-algebra 7 of B consisting of ’-invariant sets is countably 
generated, say generated by a countable sub-algebra 7’; 

(b) for every x € X, the set [x]7 = () C, where the intersection is over 
all C € T’ with x € C, is a Borel subset of X (and it coincides 
with the intersection (| C over all C € T with z € C); 

(c) [a]7 € 7, ie. [g(x)]7 = [2]7 for all x € X and g ET; 

(d) there exist a T-invariant Borel subset Xo C X, with n(X \ Xo) = 0 
for all n € P'(X), and a surjective map p: Xy9 —> Yr, tH > pe, 
measurable with respect to J and C, such that 


Pa ([z]7) =1, 
Pa’ ([z]7) =0, 
for all 2,2’ € Xo with p(2’) S$ p(2). 


Then, for x € E!(X), it suffices to set: 
(e) Ay = [z]7, with « € Xo such that p(x) = uw. 


[Note that, in [GrSc—00], the set [z]7 is defined as an intersection over 
CeéeT with x € C; but [z]7 is Borel because this is also an intersection 
over C' in the countable subalgebra 7’. We are grateful to K. Schmidt 
for an e-mail clarifying this point for us.] 

In the more general case of a second-countable locally compact group 
G, we refer to [GrSc-00, Theorem 5.2]. Choose a countable dense 
subgroup I’ of G. Define Xp as above, in terms of I and, for x € Xo, 
define also p, and [z]7 in terms of I. Then we have: 


(f) [g(@)|7 = [2]7 for all g € G and x € X (compare with (c)); 

(g) EC(X) = E™(X); 

(h) pe € Y =E(X) (compare with (d)); 

(i) pe ([z]7) = 1 and pz ([z]7) = 0 for all z,2’ € Xo with p(a’) 4 
p(a) (as in (d)). 


[The sets [z]7 may depend on the choice of I, but we will not discuss 
this further here.| 

The article [GrSc—00] is written for the case of quasi-invariant mea- 
sures. For the particular case of invariant measures, as discussed in this 
article, we could equally have quoted an earlier article by Varadarajan. 
Specifically, we refer to [Vara—63, Theorem 4.2] for the fact that our Borel 
space (Y,C) is standard. Our sets A,, with w € E(X), correspond to 
the sets Xe, with e in the space J of ergodic G-invariant measures, in 
[Vara-63]. 


We end this section with a simpler version of the previous corollary, 
for comparison with Propositions 8.2 and 8.3 below. 
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Corollary 7.10. Let G be a second-countable locally compact group 
acting continuously by homeomorphisms on a metrizable compact space 
X. Assume that there exist two distinct ergodic probability measures 
pi, M2 € EO(X). 

Then there exists two G-invariant Borel subsets A,, Ag C X such that 


Mi(A1) = 1, 1 (A2) = 0 (9.1) 

u2(A1) = 0, w2(A2) = 1. 
Proof. This is a consequence of Theorem 7.8, since, on the one hand, 
two distinct measures in €@(X) are mutually singular, and, on the other 
hand, two measures 1 and j2 are mutually singular precisely when 
there exist two Borel subsets A1, Ag in X for which the equalities of 
(9.1) hold. 


8. Kolmogorov’s Example 


The example of this section is due to Kolmogorov. It appears in 
[Fomi-50]; it has been revisited in [Vara-63], with reference to 
Kolmogorov, and in [Mait—77], without. Note that Fomin was a student 
of Kolmogorov [Ale—76]. 

Consider a number p with 0 < p < 1 andthe measure A, on Y := {0,1} 
defined by Ap(0) = p and Ap(1) = 1 — p. For i € Z, let (Yj,Ap,i) be a 
copy of (Y,Ap). Let X = [jez Xi = {0, 1}4 be the product of the Y; ’s, 
and B the usual o-algebra; (XB) is a standard Borel space. Let up be 
the product probability measure [];-74p,i, which is called a Bernoulli 
measure. We denote by S : X —> X the corresponding Bernoulli shift, 
defined by (Sx); = 2j41 for all i ¢ Z; observe that the measure Lp is 
preserved by S. Recall that the transformation S' of (X, zp) is s-ergodic, 
indeed strongly mixing; see e.g. [Walt-82, Theorems 1.12 and 1.30]. 

Denote as in Proposition 5.5 by Sym(Z) the full symmetric group 
of Z, with its standard Polish topology, and by Sym,(Z) the subgroup 
of Sym(Z) of permutations with finite support. Recall that Sym ,(Z) is 
countable, locally finite, and dense in Sym(Z). The natural action of 
Sym(Z) on X is continuous, and preserves [Up. 

Denote as in Definition 7.3 by psy (X) the compact convex set of 
Sym, (Z)-invariant probability measures on X; it is a compact convex set 
in the dual space of C(X), with the weak-*-topology, and € eying cred ) is 
the set of its extreme points. Similarly for € Sym(Z) (X). Since Syms (Z) isa 
subgroup of Sym(Z), the space P°Y™® (X) is contained in PSY" (X). 
Since Symf(Z) is dense in Sym(Z), and the latter acts continuously on 
P(X), we have indeed 


psym(Z) (X) = pdsym;(Z) (X). 


AMENABILITY OF TOPOLOGICAL GROUPS 241 
In terms of extreme points of compact convex sets, we have consequently 
E°ym@ (xX) = eSymy@ (XxX), (9.2) 


Proposition 8.1. Consider as above the natural actions of the Polish 
group Sym(Z) and of its subgroup Symy(Z) on the compact space X = 
(0, 1)7. 


(1) The set E°Y"s®(X) coincides with the set {Lplo<p<i of Bernoulli 
measures. 

(2) Similarly, E89" (xX) = {Uplo<p<i; in particular, for every p € 
[0, 1], the Bernoulli measure wp is invariant and s-ergodic for the 
action of Sym(Z). 

(3) Consider an integer k > 1, a finite sequence (pj)j=1,..,4 with 
0 < py < -++ < pe < 1, positive constants ci,...,ck with 
cp t+--- +c, = 1, and the Sym(Z)-invariant probability measure 
[= ClMp, +-+- + ChUp, € Psym@ (X). Then LL is w-ergodic. 

[fk > 2, w is not s-ergodic. 


Proof. Claim (1) is known as a result of de Finetti. The original article 
seems to be [dFin-37], but we rather refer to [Fell-71, Section VII.4, 
pages 228-9]. 

Claim (2) follows by Equality (9.2). It is the way de Finetti’s result is 
quoted in [Fomi-—50, Section 2.4]. 

For (3), observe that the Sym(Z)-orbits on X are easily described: 


(a) two one-point orbits, one with 0 ’s only, the other with 1 ’s only, 

(b) for each k > 1 two countable infinite orbits {(vi)iez € X | 
ies Ti = k} and {((t)iez € X | Vez — %) = F}, 

(c) and the uncountable orbit, that we denote by X’, of sequences 
that have infinitely many 0’s and infinitely many 1 ’s. 


In particular, the complement N of X’ in X is countable, the partition 
X = X'UN is Sym(Z)-invariant, and Sym(Z) is transitive on X’. It 
follows that every Sym(Z)-invariant subset of X is either inside N or 
contains X’. 

Because N is countable and the measure y of (3) without atoms, 
U(N) = 0. Hence the action of Sym(Z) on (X, 2) is w-ergodic. 

If k > 2, the measure yu is by definition decomposable, i.e. not s-ergodic 
(Proposition 7.4). 


Proposition 8.2. Corollary 7.10 does not extend to the situation of 
the group Sym(Z) acting on the compact space X . 

More precisely, there exists a Sym(Z)-invariant Borel subset X' of X 
such that, for every p €]0, 1[, we have wp € E&9™(X) and wp(X’) = 1. 


Proof. Let A Cc X be a Sym(Z)-invariant Borel subset. Suppose that 
there exists some p €|0,1[ such that (A) 4 0. In the previous proof, 
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we have checked that X \ A is countable. Hence, for every p €]0, 1[, we 
have [tp(A) = 1. 


Despite the failure of Corollary 7.10 in situations such as that of the 
previous proposition, we have the following result, that is Theorem 6 of 
[Fomi-—50]: 


Proposition 8.3 (Fomin). Let G be a topological group acting con- 
tinuously by homeomorphisms on a compact space Q. Assume that there 
exist two distinct s-ergodic G-invariant probability measures on Q, say 
vy and v9. 

Then there exists a Borel partition of Q in two subsets Ay, Ag that 
are v1-essentially invariant and v2-essentially invariant by G, and such 
that 


(1) vy(A1) =1 and v9(Aj) = 9, 
(2) vy(A2) =0 and ve(Ag) = 1. 


When G is a second countable locally compact group, recall we gave a 
much stronger conclusion in Corollary 7.10; in particular, the conclusion 
of Proposition 8.3 holds with A;, Ag actually G-invariant, a condition 
stronger than the above conditions of essential invariance. 

Before giving an illustration of Proposition 8.3 with Kolmogorov’s 
example, we recall the following well-known fact. For p €]0,1[, let Ep 
denote the Borel subset of X consisting of sequences (2j);¢z such that 
limps 60 shot Lin-k Xj; = p, i.e. of sequences in which the 1 ’s have 
density p. Note that Ep is invariant by Sym,(Z), but not by Sym(Z). 


Proposition 8.4. Let p,q €|0,1| with p 4 q, and let Ey, Eg C X be 
as above. Then Up(Ep) = 1 and p(Eq) = 0. 


Proof. Let go : X — R be defined by go(x) = ap. Then 


i go(x)dupy(x) = p, 
X 


by definition of 4). By Birkhoff’s ergodicity theorem, the limit 


k 
1 
- = li Ss? 
Oe a ORT phe oy 


exist for p-almost all c € X, and defines a p-almost everywhere 
constant function gj of essential value p (because the shift S' is s-ergodic 
on (X,4p)). Observe that, for c € X, we have g(x) = p if and only 
if c € Ey. Hence Ey = X, up to py-negligible sets; otherwise said: 
itp(Bp) = 1. 

Since Ey M Eq = 0, we have p(Eq) = 0. 
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If we particularize the pair (G,Q) to the pair (G = Sym(Z),X = 
{0, 1}4) of Kolmogorov’s example, and v1, v2 to the Bernoulli measures 
41/3; 42/3, then the conclusion of Proposition 8.3 holds for the subsets 
1/3, £2/3 of Proposition 8.4; for the essential invariance of these two 
subsets, see Remark 7.2. 

Let us finally mention a generalization of Equality (9.2), from just 
before Proposition 8.1, and of Part (1) of the same proposition. Consider 
a measure space (Y,C) and the product space (X,B), with X = Y4, 
with the natural action of the groups Sym(Z) and Syms (Z). For every 
probability measure A on (Y,C), let x be the probability measure on 
(X,B) that is the product of copies of A indexed by Z; we denote by 
Bern(X) the set of measures of the form a; observe that Bern(X) C 
psym(Z)(X). Then: 


(1) P8ym@ x) = PSy™¢® (X) [HeSa-63, Theorem 3.2], and there- 
fore ESYMD (xX) = ESymyD xX), 
(2) €8¥"¢® (xX) = Bern(X) [HeSa-63, Theorem 5.3]. 


9. Fomin’s Representations 


In [Fomi-—50, § 1], Fomin proves the equivalence (i) = (ii) of Proposition 
7.4 by adding one more equivalent condition, of independent interest, 
in terms of unitary representations. The object of this section is to 
describe this condition. Interaction between ergodic theory and unitary 
representations were pointed out earlier by Koopman [Koop-31]. 

Let G be a topological group acting continuously by homeomorphisms 
on a compact space X. Let C(X,T) denote the group of all continuous 
functions from X to the compact group T of complex numbers of 
modulus one. We consider the natural action of G on C(X,T), defined 
by (g(y))(x) = v(g7'(z)) for all g € G, g € C(X,T7), and z € X. 
In reference to Fomin, we denote by ¥ the corresponding semi-direct 
product C(X,T) x G (Fomin’s notation is P). We do not furnish the 
group F with any topology. 

Consider a G-invariant probability measure uw on X, and the complex 
Hilbert space Ae pt). For (v, g) € F, define a linear operator py (9, 9) 
on L2,(X,w) by 


(pu (y, DE) (2) = g(a)é(g7*(a)) 


for all E € L2(X ,) and x € X. The following proposition, which is now 
straightforward to check, is Theorem 1 in [Fomi—50]. 


Proposition 9.1. Let G,X,F, and w be as above. Then (¢,g) > 
PulY~,g) defines a unitary representation of the group F on the Hilbert 
space I2, (X, WL). 
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For every essentially bounded function y € Le (X,), we denote by 
My the multiplication operator € +> g& on I, (X, WL). 


Lemma 9.2. With the notation above, let S be a continuous linear 
operator on TACX, pL) that commutes with pyu(~,g) for all (p, 9) € F. 

Then S = My for some w € L@(X,"); moreover y is w-essentially 
G-invariant (in the sense of Definition 7.5). 


Proof. By hypothesis, $ commutes with p,(g,1) = Mg, for every 
g €C(X,T), and therefore with sums of products of such multiplication 
operators. By the Stone—Weierstrass theorem, S commutes also with Mg 
for every continuous function g : X —> C. By a standard argument, it 
follows that there exists y € L@(X, ) such that S = My; see [BourTS, 
Chap. II, § 3, no 3, Lemma 3}. 

Given that S = My commutes with p, (1, g) for all g € G, the function 
w is u-essentially G-invariant. 


The following proposition is Theorem 2 in [Fomi-50]. Note that 
Property (ii) below coincides with Property (ii) of Proposition 7.4. 


Proposition 9.3. Let G,X,F, and w be as above. The following 
properties are equivalent: 


(i) the representation p, is irreducible, 
(ii) the measure jc is s-ergodic. 


Proof. For (i) = (ii), we prove the contraposition. If w is not s-ergodic, 
there exists a s-essentially invariant Borel subset A Cc X with 0 < 
f(A) < 1. The subspace of T2(X , 4) of functions which vanish outside 
A is invariant by p,, and therefore the representation is reducible. 

The converse implication (ii) = (i) follows from Lemma 9.2 and 
Schur’s lemma (for which we refer to [BeHV—08, Theorem A.2.2]). 
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Added in Proof 


We are grateful to Maxime Gheysens and Nicolas Monod for having 
brought to our attention the following facts and references. Compare the 
definition below to that of the fixed point property, 3.3. We use “compact 
convex set” for “non-empty compact convex subset of a locally convex 
topological real vector space”. 


Definition 1. Let G be a topological group. An action by homeomor- 
phisms of G on a topological space X has a continuous orbit if there 
exists a point xo € X such that the restricted action G x Gro > Gxo 
is continuous. 

The topological group G has the Day fixed point property if every 
action of G by continuous affine transformations on a compact convex 
set, with a continuous orbit, has a fixed point. 


The following fact is the particular case for topological groups of 
Theorem 4 of [Day—64], established there for topological semigroups. 


Proposition 2. A topological group G is B-amenable (Definition 3.8) 
if and only if it has the Day fixed point property. 


This provides for example an easy proof that the symmetric group 
Sym(N) is not B-amenable (Proposition 5.5). Indeed, consider the 
natural action of this group on the set Mean(N) of all means on N, 
that is a compact convex subset of the weak dual of the real Banach 
space €R(N). Observe that, since N is infinite, the action of Sym(N) 
on N is paradoxical in the sense of Tarski, so that there is not any 
Sym(N)-invariant mean on N, i.e., there is not any point in Mean(N) 
fixed by Sym(N). Denote by D the subset of Mean(N) consisting of the 
Dirac measures; it is a Sym(N)-orbit. The action of Sym(N) on D is 
continuous. It follows that Sym(N) is not B-amenable. 

In [Day—61, § 4], Day considers one more property: topological groups 
(or semi-groups) G for which every separately continuous affine action 
on a compact convex set has a fixed point. This is a property weaker 
than B-amenability, and stronger than amenability. 

Finally, let us mention that the fixed point property of Definition 3.3 
is equivalent for locally compact groups to a formally stronger property 
introduced in [Simo-72], that is a fixed point property for weakly 
measurable affine actions on compact convex sets. 
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Abstract 

There is a recently discovered connection between the spectral theory of 
Schrodinger operators whose potentials exhibit aperiodic order and that 
of Laplacians associated with actions of groups on regular rooted trees, 
as Grigorchuk’s group of intermediate growth. In this chapter we give 
an overview of corresponding results, such as different spectral types 
in the isotropic and anisotropic cases, including Cantor spectrum of 
Lebesgue measure zero and absence of eigenvalues. Moreover, we discuss 
the relevant background as well as the combinatorial and dynamical 
tools that allow one to establish the aforementioned connection. The 
main such tool is the subshift associated with a substitution over a finite 
alphabet that defines the group algebraically via a recursive presentation 
by generators and relators. 
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Introduction 


The study of spectra of graphs associated with finitely generated groups, 
such as Cayley graphs or Schreier graphs (natural analogues of Cayley 
graphs associated to not necessarily free group actions), has a long 
history and is related to many problems in modern mathematics. Still, 
very little is known about the dependence of the spectrum of the 
Laplacian on the choice of generators in the group and on the weight 
on these generators. In a recent paper [39] we addressed the issue of 
dependence on the weights on generators in the example of Grigorchuk’s 
group of intermediate growth. More specifically, we determined the 
spectral type of the Laplacian on the Schreier graphs describing the 
action of Grigorchuk’s group on the boundary of the infinite binary 
tree and showed that it is different in the isotropic and anisotropic cases. 
In fact, the spectrum is shown to be a Cantor set of Lebesgue measure 
zero in the anisotropic case, whereas, as has been known for a long time, 
it consists of one or two intervals in the isotropic case. Moreover, we 
showed almost surely (with respect to a natural measure on the boundary 
of the tree) the absence of eigenvalues for the Laplacians in question. 

Our investigation in [39] provides (and relies on) a surprising link 
between discrete Schrodinger operators with aperiodic order and the 
substitutional dynamics arising from a presentation of the group by 
generators and relators. 

The purpose of this chapter is twofold. First, we want to survey the 
spectral theoretic results of [39]. Second, we want to put these results 
in wider perspective by discussing the background on aperiodic order 
in dimension one. In this context we present a discussion of subshifts 
and aperiodic order in Section 1 and of Schrédinger operators arising in 
models of (dis)ordered solids in Section 2. 

We also continue our study of the substitution that is instrumental 
for the results in [39]. It first appeared in 1985 in the presentation of 
Grigorchuk’s group by generators and relators found by Lysenok [66] 
(such infinite recursive presentations are now called L-presentations). 
A remarkable fact is that the same substitution also describes basic 
dynamical properties of the group. Here we review its combinatorial 
properties and carry out a detailed study of the factor map to its maximal 
equicontinuous factor, which is the binary odometer. This factor map is 
proven to be one-to-one everywhere except on three orbits. This, in turn, 
can be linked to the phenomenon of pure point diffraction, which is at the 
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core of aperiodic order. All these discussions concerning the substitution 
are contained in Section 3. 

The necessary background from graphs and dynamical systems is 
discussed in Section 4 and basics on Grigorchuk’s group and its Schreier 
graphs can be found in Section 5. 

The connection between Schrédinger operators and Laplacians on 
Schreier graphs revealed in [39] (and reviewed in Sections 6 and 7) 
can also be used to show that the Kesten-von-Neumann trace and the 
integrated density of states agree. This seems to be somewhat folklore. 
We provide a proof in Section 8. 

Our approach can certainly be carried out for various further families 
of groups generalizing Grigorchuk’s group. In particular, our results 
fully extend to a larger family of self-similar groups acting on the 
infinite binary tree considered by Sunic in [76]. Moreover, each of 
these self-similar groups belongs to an uncountable family of groups 
parametrized by sequences in a certain finite alphabet, in the same way 
that Grigorchuk’s group belongs to the family of groups (Gw),<(0,1,2)N 
constructed by the first named author in [37]. The Schreier graphs of 
the groups in the same family look the same, but their labeling by 
generators, and thus their spectra, depend on the sequence w. The 
associated subshifts are also different; in particular, they don’t have to 
come from a substitution in the case when the corresponding group is 
not self-similar. This more general setup will be considered in a separate 
chapter. 
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1. Subshifts and Aperiodic Order in One Dimension 


Long-range aperiodic order (or aperiodic order for short) denotes an 
intermediate regime of order between periodicity and randomness. It 
has received a lot of attention over the last thirty years or so; see e.g. 
the article collections and monographs [5, 8, 53, 69, 73]. 

This interest in aperiodic order has various reasons. On the one hand, 
it is due to the many remarkable and previously unknown features 
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and phenomena arising from aperiodic order in various branches of 
mathematics. On the other hand, it is also due to the relevance of 
aperiodic order in physics and chemistry. Indeed, aperiodic order pro- 
vides a mathematical theory for the description of a new type of matter 
discovered in 1982 by Shechtman via diffraction experiments [74]. These 
experiments showed sharp peaks in the diffraction pattern indicating 
long-range order and at the same time a fivefold symmetry indicating 
absence of periodicity. The discovery of solids combining both long-range 
order with aperiodicity came as a complete surprise to physicists and 
chemists, and Shechtman was honored with a Nobel Prize in 2011. By 
now, the solids in question are known as quasicrystals. 

In one dimension aperiodic order is commonly modeled by subshifts 
of low complexity. In higher dimensions it is modeled by dynamical 
systems consisting of point sets with suitable regularity features (which 
are known as Delone dynamical systems). Here, we present the necessary 
notation in order to deal with the one-dimensional situation. When we 
speak about aperiodic order subsequently, this will always mean that we 
have a subshift with suitable minimality features at our disposal. 

Let a finite set A be given. We call A the alphabet and refer to its 
elements as letters. We will consider the set A* of finite words (including 
the empty word) over the alphabet A, viewed as a free monoid (with 
the multiplication given by concatenation of words and the empty word 
representing the identity). Elements of A* will often be written as 
W = W,...Wn with w; € A. The length of a word is the number 
of its letters. It will be denoted by |- |. The empty word has length 
zero. Then, AZ denotes the set of functions from Z to A. We think of 
the elements of A” as bi-infinite words over the alphabet A. The 
set AN denotes the set of functions from N to A. We think of its 
elements of one-sided infinite words over A. They will often be denoted 
by & = &1&... 

If v, w are finite words and w € AZ satisfies 


@]... Oly) =U and @_|wI41-..@0 = w 


we write 
O=...W|v... 


and say that | denotes the position of the origin. 

We equip A with the discrete topology and A? with the product 
topology. By the Tychonoff theorem, A” is then compact. In fact, it 
is homeomorphic to the Cantor set. A pair (2, T) is called a subshift 
over A if Q is a closed subset of A% which is invariant under the shift 
transformation 


T:A® — A“, (To)(n) := o(nt+ VD. 


If there exists a natural number N 4 0 with TN w = for all w € Q, 
then (2, T) is called periodic; otherwise it is called nonperiodic. 
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Whenever @ is a word over A (finite or infinite, indexed by N or by Z) 
we define 


Sub(w) := Finite subwords of w. 


By convention, the set of finite subwords includes the empty word. Every 
subshift (Q, 7) comes naturally with the set Sub(Q) of associated finite 
words given by 


Sub(Q) := U Sub(a). 


w@eEQ 


A word v € Sub(Q) is said to occur with bounded gaps if there exists an 
Ly > 0 such that every w € Sub(Q) with |w| > Ly contains a copy of v. 
As is well known (and not hard to see) (Q, T) is minimal if and only 
if every v € Sub(Q) occurs with bounded gaps. For proofs and further 
discussion we refer to standard textbooks such as [65, 81]. We will be 
concerned here with the following strengthening of the bounded gaps 
condition. It concerns the case that Ly can be chosen as C'|v| with fixed 
C’ (independent of v). 


Definition 1.1 (Linearly repetitive). A subshift (2,T) is called 
linearly repetitive (LR), if there exists a constant C' > 0 such that any 
word v € Sub(§2) occurs in any word w € Sub(2) of length at least 
Clo. 


Remark 1.2. This notion has been discussed under various names by 
various people. In particular it was studied by Durand, Host, and Skau 
[32] in the setting of subshifts (under the name “linearly recurrent” ). 
For Delone dynamical systems it was brought forward at about the 
same time by Lagarias and Pleasants [56] under the name “linearly 
repetitive.” It has also featured in the work of Boris Solomyak [75] (under 
the name “uniformly repetitive”). It was also already discussed in an 
unpublished work of Boshernitzan in the 1990s. That work also contains 
a characterization in terms of positivity of weights. A corresponding 
result for Delone systems was recently given in [14]. 


Durand [30] gives a characterization of such subshifts in terms of 
primitive S-adic systems and shows the following (which was already 
known to Boshernitzan). 


Theorem 1.3. Let (2, T) be a linearly repetitive subshift. Then, the 
subshift is uniquely ergodic. 


Remark 1.4. In fact, linear repetitivity implies a strong form of 
subadditive ergodic theorem [61]. Validity of such a result together with 
the fundamental work of Kotani [55] allows to prove the Cantor spectrum 
of Lebesgue measure zero, see [59]. Our proof uses an extension of this 
approach worked out in [12]. 
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2. Schrddinger Operators with Aperiodic Order 


In this section, we present (parts of) the spectral theory of discrete 
Schrodinger operators associated to minimal dynamical systems. 

Schrédinger operators occupy a prominent position in the theory 
of aperiodic order. Indeed, they arise in the quantum mechanical 
description of conductance properties of quasicrystals and exhibit quite 
interesting mathematical properties. In fact, already the first two papers 
on them written by physicists suggest that the corresponding spectral 
measures are purely singular continuous and the spectrum is a Cantor 
set of Lebesgue measure zero [54, 71]. By now these features, as well 
as other conductance-related properties known as anomalous transport, 
have been thoroughly studied in a variety of models by various authors; 
see the survey articles [21, 22, 77]. The phenomenon that the underlying 
spectrum is a Cantor set of Lebesgue measure zero is usually referred 
to as Cantor spectrum of Lebesgue measure zero and this is how we will 
refer to it subsequently. 

The Laplacians on Schreier graphs discussed later will turn out to be 
unitarily equivalent to certain such Schrédinger operators. 

In this section, we first discuss basic mathematical features of 
Schrodinger operators associated to dynamical systems in Section 2.1, 
then turn to absolutely continuous spectrum and the spectrum as a 
set for subshifts in Section 2.2, and finally give some background from 
physics in Section 2.3. 


2.1. Constancy of the Spectrum and the Integrated 
Density of States (IDS) 


In this section we review some basic theory of discrete Schrodinger oper- 
ators (or rather Jacobi matrices) associated with minimal topological 
dynamical systems. This framework is slightly more general than the 
framework of minimal subshifts. The results we discuss include constancy 
of the spectrum and uniform convergence of the so-called integrated 
density of states. All of these results are well known. 

Whenever T is a homeomorphism of the compact space 2 we will refer 
to (2, T) as a topological dynamical system. Later we will meet an even 
more general definition of a dynamical system. To continuous functions 
fg: 82 — R we then associate a family of discrete operators (Hy)wea. 
Specifically, for each w € 2, Hy, is a bounded self-adjoint operator from 
?(Z) to €?(Z) acting via 


(Hyu)(n) = f(TPa)u(n — 1) + f(T"**o)u(n + D+ g( To) u(n) 


for u € €2(Z) and ne Z. 
In the case f = 1 the above operators are known as discrete 
Schrodinger operators. For general f the name Jacobi matrices is often 
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used in the literature. Here, we will deal with the case f € 1 but we will 
primarily refer to the arising operators as Schrodinger operators. 

As the operator H, is self-adjoint, the operator H,, — z is bijective 
with continuous inverse (H, — z)~! for any z € C\R. Moreover, for 
any y € €?(Z) there exists a unique positive Borel measure yw’ on R 
with 


1 
/ 7a dha) = (9, (Hy — 2)~19) 
RU—-2Z 


for any z € C\R. This measure is finite and assigns the value |p|? to 
the set R. 

For fixed w € 2, the measures 1%, gy € €2(Z) are called the spectral 
measures of H,,. The spectrum of H,, is defined as 


o(Hy) := {z € C: (Hy — zl) lacks a bounded two-sided inverse.} 


It is the smallest set containing the support of any yw (see e.g. [82]). 
The spectrum is said to be purely absolutely continuous if the spectral 
measures for all y € ?(Z) are absolutely continuous with respect to 
Lebesgue measure. The spectrum is said to be purely singular continuous 
if all spectral measures are both continuous (i.e. do not have discrete 
parts) and singular with respect to Lebesgue measure. 

The following result is well known. It can be found in various places; 
see e.g. [60]. Recall that (92, T) is called minimal if {T"w : n € Z} is 
dense in 92 for any w € £2. 


Theorem 2.1 (Constancy of the spectrum). Let (@,T) be minimal 
and f,g : 82 —> R continuous. Then, there exists a closed subset & CR 
such that the spectrum o(H,) of Hy equals X for any we Q. 


We will refer to the set © in the previous theorem as the spectrum of 
the Schrodinger operator associated to (2, T) (and (f,g)). The spectrum 
= is one of the main objects of interest in our study. 

Before turning to a finer analysis of the spectrum we will introduce a 
further quantity of interest to the so-called integrated density of states. 
In order to do so, we will assume that the underlying dynamical system 
(2,7) is not only minimal but also uniquely ergodic i.e. possesses a 
unique invariant probability measure, which we call 4. Then, we can 
associate to the family (H,) the positive measure k on R defined via 


[ Femaney = f rcte)80,50) 000) 
R 2 
(for F any continuous function on R with compact support). Here, 


d9 € £2(Z) is just the characteristic function of 0 € Z. This measure 
k is called the integrated density of states. Let 
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N :R —> [0,1], N(Z) =) dk, 
(-00, E| 


be the distribution function of k. 


There is a direct relation between the measure k and the spectrum of 
the H,. 


Theorem 2.2. Let (92, T) be minimal and uniquely ergodic. Then, the 
set & is the support of the measure k. If the function f does not vanish 
anywhere, then k does not have atoms (i.e. it assigns the value zero to 
any set containing only one element). 


Remark 2.3. This is rather standard in the theory of random operators. 
Specific variants of it can be found in many places. In particular, the first 
statement of the theorem can be found in [60]. In the case of f, which 
does not vanish anywhere, the statements of the theorem are contained 
in Section 5 of [78]. Given the constancy of the spectrum, Theorem 2.1, 
the statements are also very special cases of the results of Section 5 of 
[63]. The key ingredient for the absence of atoms is amenability of the 
underlying group Z. The statement on the support does not even need 
this property. 

As is well known, it is possible to “calculate” k via an approximation 
procedure. This will be discussed next. For € Z let 


jn 27((1,..., 2) —> C7) 
be the canonical inclusion and let p, be the adjoint of j,. Thus, 
pn : €7(Z) —> €({1,..., n}) 
is the canonical projection. Define for w € 2 then 
H? := pn Hojn- 


We will be concerned with the spectral theory of these operators. Let 
the measure k/” on R be defined as 


n 


i 
[ F@age on YFP) 8K, 5k) 


k=1 
(for any continuous F on R with compact support) and let 
N®:R— [0,1], N2(B) =] dk” (x), 
(co, B 


be its distribution function. Let Fj,...,£, be the eigenvalues of 
#H” counted with multiplicity. Then, straightforward linear algebra 
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(diagonalization of H/’ and independence of the trace of the chosen 
orthonormal basis) shows that 


a 1 
[ P(a)dki (x) = — 3 F(E}) 


holds for any continuous F on R with compact support and that the 
distribution functions of the measures k;” are given by 


Ei ] f H” not ding E 
N®(B) = # {Eigenvalues of H/” not exceeding } 


n 


where f denotes the cardinality of a set. In this sense, k/”’ is just an 
averaged eigenvalue counting. 


Theorem 2.4 (Convergence of the integrated density of states). Let 
(2, T) be minimal and uniquely ergodic. Then, for any continuous F on 
R with compact support and any € > 0 there exists an N EN with 


<E 


if P(aydb(a) — [ F(x) dk,’ (2) 
R R 


for allwe 2 andallneZ withn>=N. 


Proof. It is well known that the measures (k/’),, converge weakly toward 
k for n — oo for almost every w € 9. This can be found in many 
places, see e.g. Lemma 5.12 in [78]. (That lemma assumes that f does 
not vanish anywhere, but its proof does not use this assumption.) A 
key step in the proof is the use of the Birkhoff ergodic theorem. The 
desired statement now follows by replacing the Birkhoff ergodic theorem 
with the uniform ergodic theorem (Oxtoby theorem) valid for uniquely 
ergodic systems [81]. 


The operators H;” are sometimes thought of as arising out of the H, 
by some form of “Dirichlet boundary condition.” The previous result 
is stable under taking different “boundary conditions.” In fact, even a 
more general statement is true as we will discuss next. (The more general 
statement will even save us from saying what we mean by boundary 
condition.) Let for any n € Z and w € 2 be a self-adjoint operator C? 
on £7({1,...,n}) be given. Then, the statement of the theorem essentially 
continues to hold if the operators H,” are replaced by the operators 


H? = AP + CP 


provided the rank of the C’s is not too big. Here, the rank of an operator 
C on a finite dimensional space, denoted by rk(C), is just the dimension 
of the range of C’. 
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In order to be more specific, we introduce the measure kn on R 
defined as 


n 


es 1 ~ 
[ Fede = ms S( FAR) 8x: 5k) 


k=1 


(for any continuous F on R with compact support) and its distribution 
function given by 


a me 1 a) 
NP :R— [0,1], NO (4+) = ;, ttHigenvalues of H” not exceeding EF}. 


Corollary 2.5. Consider the situation just described. Let w € S82 be 
given with 


1 
—rk(CP) > 0,n > 0. 
n 


Then, for any continuous F on R with compact support and any € > 0 
there exists an N EN with 


<E 


| / F(x) dk(«) — / F(a) dk” (a) 
R R 


for alln => N. 


Proof. A consequence of the min-max principle, see e.g. Theorem 4.3.6 
in [51], shows 


[N"(E) — N"(B)| < *1k(C?) 
n 


independent of E € R. This directly gives the desired statement. 


The theorem allows one to obtain an inclusion formula for the 
spectrum &. Denote the spectrum of the operator A by a By 
construction, ae is just the support of the measure Re. 


Corollary 2.6. Assume the situation of the previous theorem. Then, 


the inclusion 
scQ\Ua 


n k>n 


holds for allw € Q. 


The corollary is somewhat unsatisfactory in that it only gives an 
inclusion. In certain cases more is known. This is further discussed in 
Section 8.1. For a general result on how to construct approximations 
whose spectra converge with respect to the Hausdorff distance we refer 
the reader to [13]. 
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2.2. The Spectrum as a Set and the Absolute Continuity of 
Spectral Measures 


In this section we consider Schrédinger operators associated to locally 
constant functions on minimal subshifts. Here, a function h on a subshift 
2 over a finite alphabet is locally constant if there exists an N > 0 such 
that the value of h(w) depends only on the word w(—N)...@(N). A key 
distinction in our considerations will then be whether (f, g) is periodic 
or not. 

The overall structure of the spectrum in the periodic case is well- 
known. This can be found in many references; see e.g. the mono- 
graph [78]. 


Theorem 2.7 (Periodic case). Let (2,T) be a minimal subshift and 
fig: 8 — R locally constant with f(w) 4 0 for allw € Q. If (f, 9) is 
periodic (with period N), then the spectra X of the associated Schrodinger 
operators consist of finitely many (and not more than N) closed intervals 
of positive length and all spectral measures are absolutely continuous with 
respect to Lebesgue measure. 


Remark 2.8. Note that periodicity of (f, g) may have its origin in both 
properties of (82, T) and properties of (f,g). For example periodicity 
always occurs if f = g = 1 irrespective of the nature of (@, T). Also, 
periodicity occurs for arbitrary f, g if (2, T) is periodic (i.e. there exists 
a natural number N 4 0 with T’ @ = for all w € &). 


The previous theorem gives rather complete information on © in the 
periodic case. In order to deal with the nonperiodic case, we will need a 
further assumption on (2, T). This condition is linear repetitivity. 


Theorem 2.9 (Aperiodic case [12]). Let (8, T) be a linearly repetitive 
subshift and f,g : 2 —> R locally constant with f(w) 40 for allw e€ Q. 
If (f,9) is nonperiodic, then there exists a Cantor set & of Lebesgue 
measure zero in R such that 


o(H,) = = 
for allae 2. 


Remark 2.10. The above theorem was first proven in [59] in the case 
f = 1. This result was then extended in [25] from linearly repetitive 
subshifts to arbitrary subshifts satisfying a certain condition known as 
Boshernitzan condition (B) (again for the case f = 1). In the form stated 
above it can be inferred from the recent work [12], Corollary 4. This 
corollary treats the even more general situation, where condition (B) is 
satisfied. Condition (B) was introduced by Boshernitzan as a sufficient 
condition for unique ergodicity [16] (see [25] for an alternative approach 
as well). In our context, we do not actually need its definition here. It 
suffices to know that linear repetitivity implies (B) (see e.g. [25]). 
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The previous result deals with the appearance of as a set. It also 
gives some information on the spectral type. 


Corollary 2.11. Assume the situation of the previous theorem. Then, 
no spectral measure can be absolutely continuous with respect to the 
Lebesgue measure. 


2.3. Aperiodic Order and Discrete Random 
Schrodinger Operators 


Schrodinger operators with aperiodic order can be considered within 
the context of random Schrédinger operators. Indeed, they arise in 
the quantum mechanical treatment of solids. As this may be revealing 
we briefly present this context in this section. Further discussion and 
references can be found e.g. in the textbooks [18, 19]. 

Consider a subshift (2, T’) over the finite alphabet A. Assume without 
loss of generality that A is a subset of the real numbers. Let a T-invariant 
probability measure yw on 2 be given. To these data we can associate the 
family (Hy)wee of bounded self-adjoint operators on 02(Z) acting via 


(Hou)(n) = u(n +1) + u(n — 1) + @(n)u(n). 


Such operators are (slightly special) cases of the operators considered in 
the previous section. They arise in the quantum mechanical treatment 
of disordered solids. The solid in question is modeled by the sequence 
ow € 2. The operator H, then describes the behavior of one electron 
under the influence of this w. More specifically, if the state of the electron 
is up € €7(Z) at time t = 0 then the time evolution is governed by the 
Schrodinger equation 


(0, u)(t) = —1H, u(t), u(0) = uo. 
This equation has a unique solution given by 
u(t) = eH yg, 


The behavior of this solution is then linked to the spectral properties 
of Hy. 

The two basic pieces of “philosophy” underlying the investigations are 
now the following: 


e Increasing regularity of the spectral measures increases the conduc- 
tance properties of the solid in question. 

e The more disordered the subshift is, the more singular the spectral 
measures are. 


Of course, this has to be taken with (more than) a grain of salt. In 
particular, precise meaning has to be given to what is meant by regularity 
and singularity of the spectral measures and conductance properties and 
disorder in the subshift. A large part of the theory is then devoted 
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to giving precise sense to these concepts and then prove (or disprove) 
specific formulations of the mentioned two pieces of philosophy. 

Regarding the first point of the philosophy we mention [48-50, 57] as 
basic references for proofs of lower bounds on transport via quantitative 
continuity of the spectral measures. 

As for the second point of the philosophy, the two, in some sense, most 
extreme cases are given by periodic subshifts representing the maximally 
ordered case on the one hand and the Bernoulli subshift (with uniform 
measure) on the other hand representing the maximally disordered case. 

The periodic situation can be thought of as one with maximal order. As 
discussed in the previous section the spectral measures are all absolutely 
continuous (hence not at all singular) and the spectrum consists of 
nontrivial intervals. These intervals are known in the physics literature 
as (conductance) bands. 

The Bernoulli subshift can be thought of as having maximal disorder. 
In this case all spectral measures turn out to be pure point measures 
and the spectrum is pure point spectrum with the eigenvalues densely 
filling suitable intervals. We refer to the monographs [18, 19] for details 
and further references. 

The subshifts considered in the previous section are characterized by 
some intermediate form of disorder. They are not periodic. However, they 
are still minimal and uniquely ergodic and have very low complexity. So 
they are close to the periodic situation (or rather well approximable 
by periodic models with bigger and bigger periods). Accordingly, one 
can expect the following spectral features of the associated Schrodinger 
operators: 


e Absence of absolutely continuous spectral measures (due to the 
presence of disorder i.e. the lack of periodicity). 

e Absence of point spectrum (due to the closeness to the periodic 
case). 

e Cantor spectrum of Lebesgue measure zero (as a consequence of 
approximation by periodic models with bigger and bigger periods 
and, hence, more and more gaps). 


Indeed, a large part of the theory for Schrédinger operators with ape- 
riodic order is devoted to proving these features (as well as more subtle 
properties) for specific classes of models. Further details and references 
can be found in the surveys [21, 22]. Here, we emphasize that also our 
discussion of the operators associated to a certain substitution generated 
subshift below will be focused on establishing the above features. 


3. The Substitution 1, its Finite Words Sub, and its 
Subshift (@2,, T) 


In this section we study the two-sided subshift (,, 7) induced by a 
particular substitution t on A= {a, 2, y, z} with 
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Cyan, ty =), A= 4 OSs. 


The one-sided subshift induced by this substitution had already been 
studied by Vorobets [79]. Some of our results can be seen as providing 
the two-sided counterparts to his investigations. The key ingredient in 
the investigations of [79] is that the arising one-sided sequences can 
be considered as Toeplitz sequences. This is equally true in our case 
of two-sided sequences. Thus, it seems very likely that one could also 
base our analysis of the corresponding features on the connection to 
Toeplitz sequenes. Here, we will present a different approach based on the 
n-decomposition and n-partition introduced in [39]. 

The subshift (2;, 7) will be of crucial importance for us as it will 
turn out that the Schrédinger operators associated to it are unitarily 
equivalent to the Laplacians on the Schreier graphs of the Grigorchuk’s 
group G. 

While we do not use it in the sequel, we would like to highlight that the 
substitution in question has appeared earlier in the study of Grigorchuk’s 
group G. Indeed, it is (a version of) the substitution used by Lysenok 
[66] for getting a presentation of Grigorchuk’s group G. More specifically, 
[66] gives that 


G=(a,b,c,d|l=a? ==? = d? = bed =x*((ad)*) 
= «*((adacac)*), k = 0,1,2,...), 
where « is the substitution on {a, b,c, d} obtained from t by replacing 
x by c, y by 6, and z by d. 
3.1. The Substitution t and Its Subshift: Basic Features 


Let the alphabet A = {a, x,y,z} be given and let t be the substitution 
mentioned above mapping ah ara, TR y, yR 2, zh &. Let Sub; 
be the associated set of words given by 


Sub; = U Sub(t”(w)). 
weA,neNu{o} 


Then, the following three properties obviously hold true: 


e The letter a is a prefix of t"(a) for any n € NU {0}. 
e The lengths |t”(a)| converge to co for n > oo. 
e Any letter of A occurs in t"(a) for some n. 


By the first two properties t”(a) is a prefix of t’*+!(a) for any n € 
N U {0}. Thus, there exists a unique one-sided infinite word 7 such that 
t”(a) is a prefix of n for any n € NU {0}. This 7 is then a fixed point of 
T, ie. T(y) = n. We will refer to it as the fixed point of the substitution T. 
Clearly, 7 is then a fixed point of tT” as well for any natural number n. 
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By the third property we then have 
Sub; = Sub(n). 
We can now associate to t the subshift 
Q,:={we AZ : Sub(@) C Subz}. 


Note that every other letter of 7 is an a (as can easily be seen). Thus, 
a occurs in 7 with bounded gaps. This implies that any word of Sub; 
occurs with bounded gaps (as the word is a subword of t"(a) and 7 is a 
fixed point of tT”). For this reason (2;, T) is minimal and Sub(w) = Sub, 
holds for any w € 2,;. We can then apply Theorem 1 of [26] to obtain 
the following. 


Theorem 3.1. The subshift (Q,, T) is linearly repetitive. In particular, 
(92,, T) is uniquely ergodic and minimal. 


Remark 3.2. It is well known that subshifts associated to primitive 
substitutions are linearly repetitive (see e.g. [27, 32]). Theorem 1 of [26] 
shows that linear repetitivity in fact holds for subshifts associated to 
any substitution provided minimality holds. Unique ergodicity is then a 
direct consequence of linear repetitivity due to Theorem 1.3. 


Our further considerations will be based on a more careful study of 
the t”(a). We set 


p© := a and p™ :=1"(a) for nEN. 
A direct calculation gives 
pr) = h(a) = t"(aea) = t(a)t"(a)t"(a) = pr" (a)p™. 


Thus, the following recursion formula for the p™ 


(RF) path _ p™ sn p™ 
with 
xz: n=k,k e NU {0} 
BS, =t™(“=4 y : n=3k+1,k eNU {0} 
z : n=3k+2,k eNU {0} 
is valid. 


This recursion formula is a very powerful tool. This will become clear 
in the subsequent sections. Here we first note that it implies 


jp | = arth — 1 


for all n > 0. We will now use it to present a formula for the occurrences 
of the xz, y, z in 7 and to introduce some special elements in Q,. 
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Proposition 3.3 (Positions of a,z,y,z in 7). Consider the fixed point 
n=n Nn... of t on A¥. Then the following holds. 


e The letter a occurs exactly at the positions 1+ 2k, k eNU {0} (ze. 
at the odd positions). 

e The letter x occurs exactly at the positions of the form 23"+1 +k. 
g3n+2 nk € NU{O} (i.e. at the positions of the form 23"+!.m with 
m an odd integer and n € NU {0} arbitrary). 

e The letter y occurs exactly at the positions of the form 23"+? +k. 
23743 nk € NU{0} (i.e. at the positions of the form 23"+?.m with 
m an odd integer and n € NU {0} arbitrary). 

e The letter z occurs exactly at the positions of the form 23"+3 + k. 
23n+4 nk € NU{O} (i.e. at the positions of the form 23"+3.m with 
m an odd integer and n € NU {0} arbitrary). 


Proof. We first note that the given sets of positions are pairwise disjoint 
and cover N. Thus, it suffices to show that the mentioned letters occur 
at these positions. 

The statement for a is clear. The statements for x,y,z can all be 
proven similarly. Thus, we only discuss the statement for 7. Repeated 
application of (RF) shows that 


n= pBPED py Bt) py 7 Bt) py _ 


with rj, 72,... € {x, y, 2}. Moreover, (RF) implies 


(BND = BM) py (Bn), 


Pp vp 


Combining these formula we see that x must occur at all positions of the 
form 


|p| Aq ke k(jporrh | He 1) = g3ntl 4+k- g3nt2. 


This finishes the proof. 


Remark 3.4. The previous proposition shows that n is a Toeplitz 
sequence (with periods of the form 2! for 1 € N). As mentioned already 
the analysis of the one-sided subshift in [79] is based on this property. 


We now head further to use (RF) to introduce some special two-sided 
sequences. As is not hard to see from (RF), for any n € NU {0} and any 
single letter s € {x,y,z} the word p™ sp™ occurs in n. Thus, for any 
s € {z, y, z} there exists a unique element w*) € Q, such that 


wo) =... p\™slp™ |. 


holds for all natural numbers n, where the | denotes the position of 
the origin. The elements wo ,o@™,@@ € 2, will play an important 
role in our subsequent analysis. They clearly have the property that 
they agree on N with n. Indeed, they can be shown to be exactly those 
elements of 2; which agree with n on N (see below). Here, we already 
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note that these three sequences are different. Thus, 2; contains different 
sequences, which agree on N. Hence, £2 is not periodic. 

We finish this section by noting a certain reflection invariance of our 
system. Recall that a nonempty word w = wy)... wn € A* with wj ¢ A 
is called a palindrome if w = wy... wi. An easy induction using (RF) 
shows that for any n € NU {0} the word p™ is a palindrome. It starts 
and ends with p“) for any k € NU {0} with k < n. As each p™ is a 
palindrome and any word belonging to Sub, is a subword of some p‘” 
we immediately infer that Sub; is closed under reflections in the sense 
that the following proposition holds. 


Proposition 3.5. Whenever w = w,... Wn € A* with w; € A belongs 
to Sub, then so does W := Wn... WI. 


3.2. The Main Ingredient for Our Further Analysis: 
n-Partition and n-Decomposition 


As a direct consequence of the definitions we obtain that for any n € 
N U {0} the word 7 has a (unique) decomposition as 


) ri) (n) ri Sas 


n=p D 


with i € {x,y,z}. Clearly, this decomposition can be thought of as a 
way of writing 7 with “letters” from the alphabet A, = {t"(a),t” (a), 


T™(y),T(z)} = {p™, 2, y,z}. Moreover, setting rj := a? we have 
oe = t"(rj) for any 7 € N. This way of writing 7 will be called the 


n-decomposition of n. It turns out that an analogous decomposition can 
actually be given for any element w € §2,. This will be discussed in this 
section. 

Specifically, we will discuss next that each wm € §2; admits for each 
n € NU {0} a unique decomposition of the form 


o=...p™sop™ sip™ so... 


with 


e s, € {x,y,z} for all k € Z, 
e the origin wp belongs to sop™. 


Such a decomposition will be referred to as n-decomposition of w. 
A short moment’s thought reveals that if such a decomposition exists 
at all, then it is uniquely determined by the position of any of the s;’s 
in w. Moreover, the positions of the s;’s are given by p + 2°+17, with 
p € {0,...,2"+! — 1}. Thus, the positions are given by an element of 
Z/2"*1Z. This suggests the following definition. 


Definition 3.6 (n-partition). For n € NU {0} we call an element 
P €Z/2"*!Z an n-partition of w € Q, if for any q € P both 
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© Wg € {x,y,z} and 
@ Wgqt+l1--:- Wgq4+Qrtl_] =p 
hold. 


(n) 


Clearly, for each w € Q,, existence (uniqueness) of an n-partition 
is equivalent to existence (uniqueness) of an n-decomposition. In this 
sense these two concepts are equivalent. It is not apparent that such an 
n-partition exists at all. Here is our corresponding result. 


Theorem 3.7 (Existence and Uniqueness of n-partitions [39]). Let 
n € NU {0} be given. Then any wo € 2, admits a unique n-partition 
P™(@) and the map 


PY. Os ZZ, is PO (@), 
is continuous and equivariant (i.e. P™ (Tw) = P™ (@) +1). 


Based on n-partitions (and n-decompositions) and the previous 
theorem one can then study the dynamical system (2,, T) as well as 
various questions on the structure of Sub,. This is the content of the 
next sections. 


3.3. The Maximal Equicontinuous Factor of the Dynamical 
System (2,, T) 


In this section we use n-partitions to study the structure of the dynamical 
system (92,, T). 

For any n € N we can consider the cyclic group J™ := Z/2"Z 
together with the map A™, called addition map, which sends m+2"Z to 
m+1+2"Z. Then, (J, A) is a periodic minimal dynamical system. 
Moreover, there are natural maps 


eer’ —s I on OS a OZ, 


for any n € NU {0}. These maps allow one to construct the topological 
abelian group Z as the inverse limit of the system (J+) ,2,), n EN. 
Specifically, the elements of Z2 are sequences (mn) with mn € J™ and 
Tn(Mn+1) = Mn for all n € N. This group is called the group of dyadic 
integers. The addition maps A™ are compatible with the inverse limit 
and lift to a map Ag on Ze (which is just addition by 1 on each member 
of the sequence in question). In this way, we obtain a dynamical system 
(Zz, Ag). It is known as the binary odometer. As Ag is just addition one 
can think of this system as an “adding machine.” It is well known that 
this dynamical system is minimal. 

Now Theorem 3.7 can be rephrased as saying that the dynamical 
system (J+), A(™+)) is a factor of (Q;, T) via the factor map P™. 
Clearly, the factor maps P“ are compatible with the natural canonical 
projections zp, in the sense that 


Kyo PO = po) 
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holds for all n > 1. Thus, we can “combine” the P™ for all n € N to 
get a factor map 


Pp : (27, T) — (Zo, A2), oH (NENK P™YV@)). 


We first use this to study continuous eigenvalues. Let (Y,R) be a 
dynamical system (i.e. Y is a compact space and R is a homeomorphism). 
Denote the unit circle in C by S!. Then, k € C is called a continuous 
eigenvalue of the dynamical system (Y, R) if there exists a continuous 
not everywhere vanishing function f with values in C on Y satisfying 


f(R(y)) = kf) 


for all y € Y. Such a function is called a continuous eigenfunction. 
Then, any continuous eigenvalue belongs to S! and the continuous 
eigenvalues form a group under multiplication whenever the underlying 
dynamical system is minimal. Indeed, by minimality any continuous 
eigenfunction has constant (nonvanishing) modulus. Then, the product 
of two eigenfunctions is an eigenfunction to the product of the cor- 
responding eigenvalues. The complex conjugate of an eigenfunction is 
an eigenfunction to the inverse of the corresponding eigenvalue and the 
constant function is an eigenfunction to the eigenvalue 1. Moreover, it 
is not hard to see that minimality implies that the multiplicity of each 
continuous eigenvalue is one (i.e. for any two continuous eigenfunctions 
f,g to the same eigenvalue there exists a complex number c with f = cq). 

Let now En be the group of continuous eigenvalues of J. This is just 
the subgroup of S! given by {2% iar :0<k < 2”—1}. Then, clearly the 
groups €y and J™ are dual to each other via 


En x IM —> St (z,m+2"Z) BH 2™. 


The dual maps to the canonical projections mz, : J) —+ J™ are 
then the canonical embeddings 


ln En — Ensqi1, 2b Z. 


Thus, En is a subgroup of E€n41. Let E be the group arising as the union 
of the En, i.e. 
E:= U Eyes 
n 
This group is often denoted as Z(2™). Equip it with the discrete topology 


and denote its Pontryagin dual € by J. 


Proposition 3.8. The group J is canonically isomorphic to the group 
Z2 via 


Zo — J, (mn +2"Z)n Pe (Zr 2", whenever z € E belongs to En). 
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Proof. By construction Z2 comes about as the inverse limit of the 
(I+) zy), n > 1. Then, the dual group of Ze arises as the direct limit 
of the dual system (Ey, tn). This limit is just the group €. Dualizing once 
more we find that J is indeed canonically isomorphic to the dual of Za. 
To obtain the actual formula we can proceed as follows: 

Define €7, := e2t ign Then, each €7, is a complex primitive 2th root 
of 1 with ore = €n. Now, consider an arbitrary element y € J i.e. a 
character y : € —> S!. Then, y is completely determined by its values 
on the én, n =1,2.... Moreover, for each n we have 


(y(€n41))? = 6™ 


for a unique mp € Z/2"Z as 


2 n n 
((Y@ns))) = CR)?” = En)” = = 1. 


It is not hard to see that mn+ 1 goes to my, under the natural surjection 
Tn. Thus, to each character y : € —> S! there corresponds a sequence 
(m1, me2,...) with mn € Z/2"Z and mn(mn41) = Mn for all n € N (and 
vice versa). The set of such sequences is exactly Zo. 


As any eigenvalue belongs to S!, there is a canonical embedding of 
groups € —> S!. In fact, as things are set up here this embedding is just 
inclusion of subsets of C. 

By duality, this gives rise to a group homomorphism j : Z —> J with 
dense range. This homomorphism induces then an action A of Z on J via 


A:J—J,Ay:=j(Dy. 


It is not hard to see that this A corresponds to Ag if J is identified with 
Zo according to Proposition 3.8. 
Disentangling definitions, we also infer that the action is given by 


(Ay)(k) = k y(k) 


(for y € J and k € €). We denote the arising dynamical system as (J, A). 
It is isomorphic to the binary odometer. By its very construction it is 
what is called a rotation on a compact abelian group (viz the action A 
comes about by multiplication with j(1), where 7 : Z —> J is a group 
homomorphism). Thus, by standard theory (see e.g. [4, 17]) its group of 
continuous eigenvalues is exactly given by the dual of J i.e. by €. 

Now, obviously any eigenfunction of J‘ gives immediately rise to 
an eigenfunction of 2, for the same eigenvalue (by composing with the 
factor map). As the factor map is continuous, we obtain in this way 
continuous eigenfunctions to each of the elements from €. Minimality 
easily shows that (up to an overall scaling) each of these eigenfunctions 
is unique. Thus, we obtain a family of continuous eigenfunctions. At this 
point it is not clear that all continuous eigenvalues of ({2;, T) belongs 
to €. However, this (and more) will be shown later. 
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We can use the preceding considerations to introduce a closed 
equivalence relation © on 9; via 


oo! <=> fo) =f’) 
for all eigenfunctions corresponding to eigenvalues from €. 


Then, 922;/ © is a compact topological space when equipped with the 
quotient topology. 

Clearly, o © ow! if and only if Tw ~ To’. Thus, the relation ~ is 
compatible with the shift operation. Hence, the quotient 2, / ~ becomes 
a dynamical system with the operation T~ induced by the shift. 

Fix now an wg € @,. As discussed above continuous eigenfunctions do 
not vanish anywhere and the multiplicity of each continuous eigenvalue 
is one. Thus, for each k € € there exists a unique eigenfunction f, to k 
on $2; with f,(wo) = 1. Then, the arising system of eigenfunctions will 
have the property that 


fh fig = fin the: fa = fe 
for all k, ki, kg € €. Thus, any » € @, will give rise to an element of 
J =€E via 
E— Sl kw feo). 
Even more is true and the following result holds. It is well known and 


can be found in various places in the literature. Recent discussions are 
given in [4, 6, 9]. 


Lemma 3.9. The dynamical systems (Q;/ ~,T~) and (J, A) are 
conjugate via the map 


[ol > (kr f(@)). 


In particular, the eigenvalues of (2;/ *,T~) are exactly given by the 
elements of E. 


We now further investigate ~ and provide a characterization of w ~ 
w’. Here, we will again use the special words w), wo, w™ introduced 
above. 


Proposition 3.10 (Characterization +). For w,0' € @, the relation 
wa! holds if and only if one of the following two properties hold: 


/ 
eo=a'. 
e There exist s,s’ € {x,y,z} with s £s' and N €Z withw = To 
and of = TN a), 
Proof. Let w,@' with w 4 ow’ and w © a’ be given. By definition of 
~ and the above construction of the eigenfunctions of (2,, T), we then 
have that 
P(g) = Pa’) 
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for all n € NU {0}. Call this quantity P™. In the remaining part of 
the proof we will identify such a P‘) with its unique representative in 
Ve ane 

As w 4 a’ we infer that one of the sequences (P™) nenuto} or (27+1 — 
P™)neNu{o} must be bounded. (Otherwise, w and w’ would agree on 
larger and larger pieces around the origin and then had to be equal.) 
Assume without loss of generality that (P‘™) is bounded. By restricting 
attention to a subsequence we can then assume without loss of generality 
that P™ = P for all n. After shifting the sequences by P to the left we 
can then assume without loss of generality that P{ = 0 for all n. By 
definition of P, there then exist letters s,s’ € {x, y, z} with 


o=...s\p™... and wo =...s'|p™... 


for all n > 0, where | denotes the position of the origin. This gives, by 
definition of the n-partition that in fact 


o=...p™s\p™ and a! =...p™s'|p™... 


for all n > 0. Thus, we obtain w = w) and w! = ow). As w £ a! we 
infer that s 4 s’. This finishes the proof. 


The previous result shows that the factor map from 22; to 92;/ © is 
one-to-one except on three orbits. This has strong consequences as will 
be discussed next. 

As (8,,T) is uniquely ergodic, there exists a unique T-invariant 
probability measure 4 on §2,;. The operation T then induces a unitary 
operator Uy on the associated L?-space via 


Ur: P(2,A) > P(2,a), Urf =foT. 


Anelement f € L7(,A) (with f # 0) is called a measurable eigenfunction 
tok eS! if Urf = kf. The subshift is said to have pure point spectrum 
if there exists an orthonormal basis of measurable eigenfunctions. From 
the two previous results we immediately infer the following. 


Theorem 3.11. The dynamical system (2,,T) has pure point spec- 
trum and any measurable eigenvalue is a continuous eigenvalue and 
belongs to €. 


Proof. The dynamical system (J, A) has pure point spectrum with all 
eigenvalues being continuous and belonging to € as it is a shift on the 
compact abelian group J which is the Pontryagin dual of €, [81]. As the 
dynamical system ,/ © is conjugate to (J, A) due to Lemma 3.9 it 
has also pure point spectrum with all eigenvalues being continuous and 
belonging to €. 

Now, Proposition 3.10 shows that factor map from 92; to 92;/ * is 
one-to-one except on three countable orbits. This implies that in terms 
of measures the associated L?-spaces are isomorphic. This easily gives 
the desired result. 
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Remark 3.12. The occurrence of pure point dynamical spectrum is a 
key feature in the investigation of aperiodic order. In fact, while there 
is no axiomatic framework for aperiodic order a distinctive feature is 
(pure) point diffraction. Now, pure point diffraction has been shown to 
be equivalent to pure point dynamical spectrum. For the case of subshifts 
at hand this can be inferred (after some work) from [72]. A more general 
result (dealing with uniquely ergodic Delone systems) was then given 
in [58]. The result can even further be generalized to measure dynamical 
systems and even processes [7, 62, 64]. 


The previous theorem implies that (J, A) is exactly the maximal 
equicontinuous factor of (@,, T) (see e.g. [3] for definition). Indeed, one 
of the many equivalent ways to describe this factor is as the dual group 
of the group of continuous eigenvalues. A recent discussion of this and 
various related facts can be found in [4]. Henceforth, we will denote 
the maximal equicontinuous factor of (2,, T) by (Q™**, T™*) and the 
corresponding factor map by z™®*. Then, our findings so far provide the 
following theorem. 


Theorem 3.13 (Factor map onto 27"*"). The three dynamical systems 
(Qr/*, T~), (I, A) and (Q™8*, T™**) are topologically conjugate. The 
factor map 
qr ax . 2: aes, max 
T 


is one-to-one in all points except on the images of the points of the orbits 
of oa, . In these points it is three-to-one. 


Remark 3.14. 


e Minimal systems with the property that their factor map to the 
maximal equicontinuous factor is one-to-one in at least one point 
are known as almost automorphic systems (see e.g. [4] for further 
details). Their study has attracted a lot of attention. As the previous 
result shows, (§2;, T) is an almost automorphic system. In fact, as 
9, is uncountable, the factor map is one-to-one in almost every 
point with respect to the unique invariant probability measure 
A on §2;. This is sometimes expressed as the factor map from 
(2,,T) to its maximal equicontinuous factor is almost-everywhere 
one-to-one. 

e There is an alternative description of the relation ~ for almost auto- 
morphic systems. More specifically, define the proximality relation 
~ by 

o~@ <> inf d(T", T™o) =0, 
neZ 


where d is any metric on 9, inducing the topology. (Due to 
compactness of 2, the relation is indeed independent of the chosen 
metric.) Note that the proximality relation can be thought of as 
describing asymptotic agreement. Then, for almost automorphic 
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systems the proximality relation ~ and the relation ~ agree. This 
can be found in the book of Auslander [3]. A recent discussion is 
given in [4]. In fact, this result is even more general in that one does 
not need almost automorphy but only a weaker condition called 
coincidence rank one. We refrain from further discussion of this 
concept and refer the reader to e.g. [9] for further investigation. We 
just note here that in our situation, equality of ~ and © and the 
characterization of © in Proposition 3.10 imply that sequences that 
are proximal (i.e. asymptotically equal) are in fact equal everywhere 
up to one position. 

e In [79] Vorobets shows that the one-dimensional subshift associated 
to t has the binary odometer (J, A) as a factor with the factor map 
being 1 : 1 in all points except three orbits. He uses this to conclude 
pure point spectrum and unique ergodicity. Our corresponding 
results above for the two-sided subshift can therefore be seen as ana- 
logues to his results. However, our approach is different: It is based 
on n-partition whereas his approach is based on Toeplitz sequences. 


3.4. Powers and the Index (Critical Exponent) of Sub, 


In this section we have a closer look at the structure of Sub;. The main 
focus will be on occurrences of three blocks and the index of words (also 
known as critical exponent). We will use n-partitions in our study in a 
spirit similar to [23, 24]. 

We start by investigating occurrences of almost four blocks. An easy 
inspection of n gives the following lemma. 


Lemma 3.15. The word araxara belongs to Sub. 


The previous result deals with occurrence of a cube (and even a bit 
more) of the special word az. As Sub, is invariant under t this then 
yields the occurrence of many more cubes. This can be used to exclude 
eigenvalues for Schrodinger operators via the so-called Gordon argument. 
This is discussed in [39] for the case at hand. Such an application of the 
Gordon arguments for subshifts coming from substitution goes back to 
[20], see [21] for a survey as well. 

Next we show there are no fourth powers occurring in Sub;. 

Let us recall from the considerations on n-partitions in Section 3.2 
that there exists a sequence 7 7h, .. € {z,y, z}S such that the fixed 
point 7 of t can be written as 


(n) rin) (n) ae 


n= p Pp 


with pl =t"(r;) € {z, y, z} for any n € N U {0}. This way of writing 
is referred to as the n-decomposition of n. Call the sequence 


rp) — 7) . €{2,y, ae 
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the nth derived sequence of n. Note that for any natural number n the 
combinatorial properties of the sequence r) are exactly the same as 
the combinatorial properties of the sequence r™ as t” is injective on 
{x,y,z} and r™ = 1"(r™) holds. 


Proposition 3.16. [39] In the derived sequence r = r™ the letters y 
and z always occur isolated preceded and followed by an x. The letter x 
always occurs either isolated (i.e. preceded and followed by elements of 
{y, 2}) or in the form xxx. In particular, there is no occurrence of rxxx. 
The analogue statements hold for any natural number n for the sequence 
r™ (with x,y,z replaced by t’(x),t(y) and t"(z)). 


Remark 3.17. In terms of information the derived sequences r‘™ 
are as useful as the original sequence. We will base our subsequent 
investigations on the relatively simple properties of the derived sequence 
r© given in the preceding proposition. More information should be 
obtainable from a more detailed study of the derived sequences. 


The n-decomposition of n gives a way of writing 7 as a concatenation 
of the words p™ and elements from {x,y,z}. For example, n can be 
written as 


n = (ara) y(ara)z(axra)y(ax a)xz(a xa)... 
yea 


axa 


where we have put brackets around the p“ = aza. Still, 7 can contain 
further occurrences of p“) as indicated in the preceding formula. More 
generally, it is not true that a p™ occurring somewhere in n is in fact 
one of the words p“™ appearing in the n-decomposition of n. However, 
it turns out that whenever p\™ sp™ occurs in n then both of its p‘” 
actually stem from the n-partition. In this sense, there is some form of 
alignment. This is the content of the next proposition [39]. 


Proposition 3.18 (Alignment of the p([™sp™). Consider a natural 
number n and s € {x,y,z}. If p™sp™ occurs in n at the position | 
(1.€. MINA +- M4 [psp™|-1 = p™ sp™ holds), then | is of the form 
14+ k2"+1 for some k € NU {0}. This means that if p™sp™ occurs 
somewhere in n then both of its words p™ actually agree with blocks 
p™ appearing in the n-decomposition n = pe pM 8 pO, a3 


If w is a finite word in Sub; and v is a prefix of w and N is a natural 


number we define the index of the word w in w v by N+ ia and denote 


it by Ind(w, w v). We then define the index of the word w by 
Ind(w) := max{Ind(w, wrv):v prefix of w, N EN, wroe Sub;}. 


As our subshift is minimal and aperiodic the index of every word can 
easily be seen to be finite. 
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Theorem 3.19 (Index of 2,). (a) For every w € Sub, the inequality 
Ind(w) < 4 holds. In particular, n does not contain a fourth power. 
(b) We have 4 = sup{Ind(w) : w € Subz}. 


Remark. The supremum over all the indices is sometimes known as 
the critical exponent. 


Proof. A proof can be found in [39]. Here, we only sketch the idea. By 
Lemma 3.15 the word w = araraxa = va (with v = ax) belongs to 
Sub,. For each n € N we then have 


t™ (va) = (t(v))3"(a) = pr? (x)p™ r(x) p™ r(x) p(n) 


and we infer that the index must be at least 4. Thus, it suffices to show 
that 7 does not contain a fourth power. To do so, it suffices to consider 
occurrences of powers of words w in n. For short words the statement 
can easily be checked. Consider now the case |p| + 1 = 2"+! < |Jw| < 
|p'"+)| for some n > 1. Assume that www occurs in Sub;. Then, the 
proposition on alignment gives that the length of w is given by |w| = 
|p\™|+1=2"+!, This then easily implies the desired statement. 


Remark 3.20. The proof of the theorem shows that the length of any 
word w € Sub; whose cube www also belongs to Sub; is given by 2” for 
some n EN. 


3.5. The Word Complexity of Sub, 


In this section we present a result on the word complexity of Sub;. 
A detailed proof can be found in [40]. 
We define the word complexity of the subshift (Q,, T) as 


C:NU{0}—>N C(L) = number of elements of Sub; of length L. 


Recall that a word w € Sub; is called right special if the set of its 
extensions 


{s € {a,z,y,z}: ws € Sub;} 
has more than one element. 


Theorem 3.21 (Complexity theorem). (a) For any n > 2 and L = 
2° +k withO<k <2” we have 


3:0<k<2r-1 


C(L+1)—-C(L) = eee <k <2”. 


(b) The complexity function C satisfies 
C(1) = 4,C(2) = 6,C(3) = 8 
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and then for any n> 2 and L=2"+k withO<k <2" 


gntl 4 on-11 3b:0<k <2”! 
gntl on 4 op: gn-l< kf <Q”, 


(c) Consider n > 2 and L= 2"+k withO<k <2”. 


e If0<k <2"~!, then there exist exactly two right special words of 
length L. These are given by the suffix of p™ of length L (which 
can be extended by x,y,z) and the suffix of p%—?1"—?(x)p"-)) of 
length L (which can be extended by t”—2(x) and by t™1(2)). 

e [f2"-1 <k <2"~!, then there exists exactly one right special words 
of length L. This is given by the suffix of p™ of length L (which can 
be extended by x,y,z). 


Proof. The proof relies on a detailed investigation of the n-partition 
of n. This allows one to directly determine all words of length |p‘ | = 
2°+1 _ 1 in Sub; and this gives C(2"+! — 1) for all n € NU {0}. At 
the same time the study of the n-partition allows one to obtain a lower 
bound on the difference C(Z+ 1) — C(L) for all LEN. This in turn gives 
a lower bound on C. Combining the lower bound and the precise values 
we obtain the statements of the theorem. 


Remark 3.22. As Sub; is closed under reflections by Proposition 3.5 
the above statements about right special words easily translate on 
corresponding statements about left special words. (Here, a word w € 
Sub; is called left special if the set {s € {a,z,y,z} : sw € Sub;} has 
more than one element.) This shows in particular that the words p™, 
n € NU {0}, are both right special and left special (and are the only 
words with this property). 


3.6. Generating the Fixed Point 7 by an Automaton 


In this section we present an automaton that generates the fixed point 
n of the substitution t. This is well in line with general theory on 
how to exhibit fixed points of substitutions by automata, see e.g. the 
monograph [2] to which we also refer for background on automata. There 
are numerous applications of automatic sequences in group theory. For a 
recent example and a list of further references we refer the reader to [41]. 

Consider the automaton A from Figure 10.1. It is an automaton over 
the alphabet {0,1} with four states qo, q1, q2, 93, which are labeled by 
a,2Z,y, 2 respectively. Then, the infinite sequence 


Ag : NU {0} — {a, 2, y, 2} 


generated by the automaton with initial state qo is defined as follows: 
Write n € NU {0} in its binary expansion as 


n= m2' + 22° 14... 4+ a 12+ 4% 
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<@™ 
© - Fae | 
9.9 


Figure 10.1 The automaton generating 7 


with 7 € NU {0} and a; € {0,1}, 7 = 0,...,%. Consider now the path py, 
in the automaton starting in qo and following the sequence 2021 ... X; 
Then, Ag (n) is defined as the label of the state where this path ends. 


Theorem 3.23. The fixed point n of t agrees with Ag, (where the fixed 
point is considered as a map from N U {0} to {a, 2, y, z}). 


This theorem is an immediate consequence of the next proposition. To 
state the proposition we will need some further pieces of notation. For 
each n € NU{0} and each state q of the automaton we define f“”(q) to be 
the word over {a, x, y, z} of length 2” obtained in the following way: Let 
UL,---, van be the list of all words of length n over {0, 1} in lexicographic 
order (where 0 < 1). Consider now for each k = 1,...,2" the path in 
the automaton starting at q and following the word vz. Then, the kth 
letter of f(”)(q) is defined as the label of the state where this path ends. 

Define the letters 


x: n=k,k €NU {0} 
Sp=t (eo =f yo : n=3k+1,ke NU {0} 
z : n=3k+2,k eNU {0} 
and recall the recursion formula 


(MHD = pM 5 9() 


p SnP 
with p™ := t™(a). 

Proposition 3.24. For each natural number n and each i = 0,1,2,3 
we have 


FTO (Gi) =p sn 4i- 


Proof. This is proven by induction. The case n = 1 follows by 
inspection. Assume now that the statement is true for some n > 1 and 
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consider n + 1. Then the lexicographic ordering of the words of length 
n+ 2 over {0,1} is given by 


Ov, aa Ovgn+1, lvy, 135, 1, Ugnt1, 


where v1,..., Ugnt+1 is the lexicographic ordering of the words of length 
n+ 1 over {0,1}. Thus, from the rules of the automaton we obtain for 
each i = 0,1,2,3 


FOP) a) = FO Cao fe" (Qu, 


where we set qg4 = qj. From our assumption for n and the recursion we 
then find 


f%2) (q,) = p™ snp™ snrr4+é = per gus 


for each 7 = 0,1, 2,3 and this is the desired statement. 


3.7. Replacing t by a Primitive Substitution 


The substitution t arises naturally in the study of Grigorchuk groups G 
and its Schreier graphs (see below). From the point of view of subshifts it 
has the (slight) disadvantage of not being primitive. It turns out that it 
is possible to find a primitive substitution € with the same fixed point— 
and hence the same subshift—as t. This can then be used to obtain 
alternative proofs for the (proven above) linear repetitivity and pure 
discreteness of the spectrum. This is discussed at the end of this section. 
The material presented here was pointed out to us by Fabien Durand [31]. 
Consider the substitution ¢ on the alphabet {a, 2, y, z} with 


$(a) = ax, O(y) = ay, O(y) = az, 6(z) = ar(= F(a). 


As any letter of the alphabet is contained in ¢4(s) for any letter s, this 
is a primitive substitution. 

To relate it to tT we use (again) for n € NU {0} the letters s, = r(x) 
as well as the recursion formula pt) = p(™ s,p™ with p™ := t™(a). 


Proposition 3.25. For any natural number n the equality 
"(a) = 0" *(a)sn-1 
holds. In particular, the fixed point n of t agrees with the fixed point of ¢. 


Proof. This is shown by induction. The cases n = 1,2,3 are easily 
checked by direct inspection. Assume now that the statement is true for 
all integers up to some n > 3. Then, we can compute 


em la) =o" (ar) =o" (ae™(a) =o" (a)e™ (ay) 
=.= e™(aye™ 1 (aye™-*(a)e" 9 (ae). 
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By ¢(a) = az we then find 
Ora) eae ae” ae") 


From our assumption on n we then infer 


cha) =e" (aap tsp” @shaat™ “ans: 


Successive application of the recursion formula and the fact that syn_3 = 
Sn, then give 
PT (a) = T° 1 (a) 8-10"? (a) 8-2(T" 3 (a) Sp_3t” 9 (a) Sn—3 
= 0" *(a)8n—1t” 7 (a)Sn—2T” *(4) 813 
= 1" 1(a)8_-10" (a) 8n—3 
=T"(a)sn_-3 


=T"(a)Sn. 


This is the desired statement. 


Corollary 3.26. The subshift (2¢,T) generated by the primitive 
substitution € agrees with the subshift (827, T). 


As it is well known see e.g. [27, 30], that subshifts associated to 
primitive substitutions are linearly repetitive, an immediate consequence 
of the previous corollary is that (2;, T) is linearly repetitive. Also, as ¢ 
has constant length (i.e. the length of ¢(t) is the same for any letter ¢) 
and ¢(¢) starts with a for any letter t, we can apply a result of Dekking 
[29] to obtain purely discrete spectrum. 


4. Background on Graphs and Dynamical Systems 


In this section we recall some basic notions and concepts from the theory 
of graphs, dynamical systems, and Schreier graphs. In the next section 
we will meet all these abstract concepts in the context of a particular 
group. 

Here we first recall some terminology from the theory of graphs 
and introduce the topological space of (isomorphism classes of) rooted 
labeled graphs. 

Let B be a finite nonempty set together with an involution Bb — B, 
bt b. A graph with edges labeled by B is a pair (V, E) consisting of a set 
V and aset EC V x V x B such that (v, w, b) belongs to E if (w, v, b) 
belongs to E. The elements of V are called vertices and the elements of 
F are called edges. Whenever e = (v,w, 0) is an edge, then 6 is called 
the label, v = o(e) the origin and w = t(e) the terminal vertex of the 
edge. We say that there is an edge from the vertex v to the vertex w with 
label b if (v, w, 6) belongs to E. An edge e is said to emanate from the 
vertex v if vu = o(e). The number of edges emanating from a vertex is 
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called the degree of the vertex. An edge of the form (v, v, b) is called a 
loop at v (with label b). 

We will need the combinatorial distance on a graph given as follows. 
Each vertex has distance 0 to itself. The distance between different 
vertices v and w is 1 if and only if there exists a label b such that (vu, w, b) 
belongs to E. More generally the distance between different vertices v 
and w is then defined inductively as the smallest natural number n such 
that there exists a vertex v’ with distance n—1 to v and distance 1 to w. 
If no such n exists, the combinatorial distance is defined to be oo. The 
graph is called connected if the distance between any two if its vertices 
is finite. Likewise the connected component of a vertex is the set of all 
vertices with finite distance to it. 

A ray in an infinite graph is an infinite sequence vo, v1,... of pairwise 
different vertices with distance 1 between consecutive vertices. Two rays 
are equivalent if there exists a third ray containing infinitely many 
vertices of each of the rays. An equivalence class of rays is called an end 
of the graph. For example, finite graphs have 0 ends, the Cayley graphs 
of Z, Z? and F»2 (the free group on two letters) have 2,1 and infinitely 
many ends, respectively. In general, the Cayley graph of a group may 
have 0, 1,2 or infinitely many ends. 

A rooted graph is a pair consisting of a graph and a vertex belonging 
to the vertex set of the graph. This vertex is then called the root. 

Two rooted graphs (Gj, v1) and (G2, v2) labeled by the same set B 
are called isomorphic if there exists a bijective map § from the vertices 
of G, to the vertices of Go taking v, to vg such that the vertices x and 
y in G1 are connected by an edge of color b if and only if their images 
in V(G2) are connected by an edge of color 6. In this case we write 
(G1, v1) = (G2, v2). 

Let us now consider the set G,(B) of isomorphism classes of connected 
rooted graphs labeled with elements from 6 that we endow with the 
following natural metric. The distance between the isomorphism classes 
of the two rooted graphs (Yj, v1) and (Y2, v2) is defined as 


1 
dist(((Y1, v1)], [(Y2, v2))) := int | : By, (11,7) = By, (v2, nj 


where By(v,r) is the (labeled) ball of radius r centered in v in the 
combinatorial metric on Y. If we only consider graphs of uniformly 
bounded degree (as we will in this chapter), the space G,.(B) is compact. 

We now turn to dynamical systems. Whenever the group H acts on 
a compact space Y via the continuous map a: H x Y —> Y we call 
(Y,H,a) a dynamical system. We will mostly suppress the a in the 
notation. In particular, we will write (Y,H) instead of (Y,H,qa) and 
we will write ty for a¢(y) (with y € Y and t € H). If Z is the infinite 
cyclic group Z, then the action of H on Y is determined by T := a(1) 
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and we then just write (Y, T) instead of (Y,Z) (and this is well in line 
with the notation used in the previous sections). 
Whenever a dynamical system (Y, H) and y € Y are given we call 


{ty: t € H} 


the orbit of y (under H). 

The dynamical system (Y, H) is minimal if every orbit is dense in Y. 
The dynamical system (Y,H) is uniquely ergodic if there exists exactly 
one H-invariant probability measure on Y. 

The dynamical system (Y,H) is called a factor of the dynamical 
system (Y’, H) if there exists a continuous surjective map x : Y’ —> Y 
with x (ty) = tx(y) for all t € H and y € Y’. This map x is then called 
a factor map. The dynamical system (Y, H) is then also referred to as 
an extension of the dynamical system (Y’, #). 

We will deal with graphs arising from dynamical systems. More 
specifically, consider a group H generated by a symmetric finite set 
S and assume that H acts on a compact space Y. Then any point 
y € Y gives rise to the orbital Schreier graph of y denoted by ly. This 
is a rooted graph labeled by $, which is equipped with the involution 
S —> S,st+ s~!. The set of vertices of Iy is given by the points in the 
orbit of y. The root is given by y and there is an edge from v to w with 
label s € S' if sx = y. Note that by the required symmetry of S we then 
also have an edge from w to v with label s~! (as is needed according to 
our definition of a labeled graph). 

As an example we note the Cayley graph of a group G with generating 
set S. This Cayley graph (with the neutral element of G as the root) 
is the orbital Schreier graph of G corresponding to the action of G on 
itself via left multiplication. 

If the elements of S happen to all be involutions then there will be an 
edge of label s from v to w if and only if there is an edge of label s from 
w to v. This is the situation we will encounter in the next section. 


5. Grigorchuk’s Group G, its Schreier Graphs and the 
Associated Laplacians 


In this section we introduce the main object of our interest: the first 
group of intermediate growth G and the Laplacians on the associated 
Schreier graphs. The group G is the first group with intermediate word 
growth and was introduced by the first author in [36, 37]. By now it is 
generally known as Grigorchuk’s group G and this is how we will refer 
to it.! It is generated by four involutions a, b, c,d. With notation to be 
introduced presently, the group G can be viewed as a group acting by 
automorphisms on the full infinite binary tree 7. The action extends 


1 This is in spite of the first author’s reluctance. 
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by continuity to an action by homeomorphisms on the boundary a7. 
The action of G gives rise to Schreier graphs. 

The Schreier graphs arising from actions of automorphism groups of 
infinite regular rooted trees have attracted substantial attention in recent 
years [28, 43, 70]. Of particular interest are so-called self-similar groups 
(as G) whose action reflects the self-similar structure of the tree. Their 
Schreier graphs also have self-similarity features. Some are closely related 
to Julia sets [11, 28, 70]; others are fractal sets close to e.g. the Sierpinski 
gasket or the Apollonian gasket [10, 44, 45]. 

The spectra of the Laplacians on such graphs have been described in 
some cases [10, 42, 44-46]. The spectrum can be a union of intervals [42], 
a Cantor set [10], or a union of a Cantor set together with an infinite 
set of isolated points that accumulate to it, as in the case of the so- 
called Hanoi tower group [45]. These investigations all use the method 
introduced in [10]. Here we study the spectra of the Laplacians on the 
Schreier graphs by a different method via the connection to aperiodic 
order. This allows us to determine the spectral type of the Laplacians 
for arbitrary choice of weights on the generators of the group. We thus 
obtain one of the rare examples where we understand how the spectrum 
of the Laplacian depends on the weights. In general this dependence is 
very poorly understood, as well as the dependence of the spectrum on 
the generating set in the group. Let us mention here a recent result of 
Grabowski and Virag who show in [35] that there is a weighted Laplacian 
on the lamplighter group that has singular continous spectrum, whereas 
the unweighted Laplacian on the same generators is known to have pure 
point spectrum [46]. 


5.1. Grigorchuk’s Group G 


Let us denote by Tg, ¢ € N with g > 2, the rooted regular tree of degree q. 
The vertex set of Jy is given by {0,...,q—1}*, i.e. the set of all words 
over the alphabet {0,...,q—1}. The root of 7g is the empty word. There 
is an edge between v and w whenever w = vk or v = wk holds for some 
k € {0,...q—1}. The words w € {0,...,q—1}” constitute the nth level 
of the tree. (In the tree, they are at combinatorial distance exactly n 
from the root.) 

The boundary 07, of 7g consisting of infinite geodesic rays in Tg 
emanating from the root (i.e. infinite paths starting in the root all of 
whose edges are pairwise different) can then be identified with the set 
{0,1,...,q — 1} of one-sided infinite words over {0,1,...,q — 1}. As 
mentioned above, the set {0,1,...,q—1}% is equipped with the product 
topology and is thus a compact space homeomorphic to the Cantor set. 

Any automorphism of 7, necessarily preserves the root (which is the 
only vertex with degree q) and maps paths starting in the root to paths 
starting in the root. This readily implies that any automorphisms group 
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action on Tq is level preserving, i.e. maps words of length n to words of 
length n. Any such action then extends to an action of the same group 
by homeomorphisms on the boundary 079. 

A regular rooted tree is a self-similar object. Indeed, the subtree rooted 
at an arbitrary vertex of the tree is isomorphic to the whole tree 77. The 
full group of automorphisms inherits this self-similarity property in the 
following sense: any automorphism of 7, is completely determined by 
the permutation it induces on the g branches growing from the root (an 
element of Sym(q)) and the collection of g automorphisms (go, ... , 9q—1) 
which coincide with the restrictions of g on the corresponding branches. 

However, if one is interested in a subgroup H < Aug(Ty) and wants it 
to be self-similar, one has to impose the condition that all the restrictions 
(90,--+,9q—-1) are again elements of the same group H, so that every 
g € H can be represented as 


g = a (90, re erigrag) Gq—-1); 


where a belongs to Sym(q) and describes the action of g on the first 
level of Tg and gj € G,i =0,...,q—1 is the restriction of g on the full 
subtree of 7, rooted at the vertex i of the first level of Ty. This leads to 
the following definition. 


Definition 5.1. A group H of automorphisms of 7, is self-similar if, 
for all g € H,z € {0,...,q—1}, there exist h € H,y € {0,...,q—1} such 
that 

g(tw) = yh(w), 
for all finite words w over the alphabet {0,...,q— 1}. 

We refer the interested reader to [43, 70] for more information about 
self-similar groups. 

We now turn our attention to one particular example of a self-similar 
group that will be the central object of our study, the Grigorchuk 
group G. It is generated by four automorphisms a, b,c, d of the rooted 
binary tree 7 = 72 as follows: 

a(Ow) = lw, a(_w) = Ow; 

b(0w) = Oa(w), bw) = 1c(w); 

c(Ow) = Oa(w), c(1lw) = 1d(w); 

d(Ow) = Ow, dw) = 1b(w), 
for an arbitrary word w over {0,1}. These automorphisms can also be 
expressed in the self-similar form, as above: 


a = €(id, id), b= e(a,c), c= e(a,d), d = e(id, dD), 


where e and € are, respectively, the trivial and the nontrivial permuta- 
tions in the group Sym(2). 
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Remark 5.2. Observe that all the generators are involutions and that 
{1, b, c, d} commute and constitute a group isomorphic to the Klein group 
Z/2Z x Z/2Z. Let us also mention that there are many more relations 
and the group is not finitely presented. 


For our subsequent discussion it will be important that G acts 
transitively on each level, i.e. for arbitrary words w,u over {0,1} with 
the same length there exists a g € G with gu=w. 


5.2. The Schreier Graphs of G and the Dynamical 
System (X, G) 


The action of G on the set V = {0,1}* of vertices of the rooted binary 
tree and on its boundary 87 = {0,1}N induces on this set the structure 
of a graph labeled with {a, b, c,d} C G. Specifically, the vertex set of 
this graph is given by {0, 1}*U{0, 1}N and there is an edge with label s € 
{a, 6, c, d} and origin v and terminal vertex w if and only if sv = w holds. 
Note that the set of arising edges has indeed the symmetry property 
required in our definition of a labeled graph as any s € {a,b,c, d} is 
an involution. For the first three levels of the tree the resulting graphs 
are shown in Figure 10.2. Thus, the rooted graphs consisting of such a 
connected component together with a root are orbital Schreier graphs 
(in the sense defined above). The connected components corresponding 
to the first three levels of the tree are shown in Figure 10.2. 

There are two kinds of such connected components viz finite and 
infinite components. The finite ones correspond to the levels of the tree 
(recall that the action of G on the levels of the tree is transitive). We 
will refer to them simply as Schreier graphs. The infinite ones correspond 
to the orbits of the action on the boundary. The corresponding rooted 
graphs will be referred to as orbital Schreier graphs. 

We will need the isomorphism classes of Schreier graphs and therefore 
introduce the map 


F: V(T)U0T —> Gx (fa, b,c, d}), F(v) = [v, v)], 


where I, is the connected component of v. 

The graphs [ and I, coincide (as nonrooted graphs) whenever v 
and w are in the same orbit of the action of G. In particular, as G acts 
transitively on each level of the tree, for n € N we can therefore define 


oe 


which coincides with I, for all w € V(7) with |w| = n. In general, the 
graph I’, has a linear shape; it has 2”~! simple edges, all labeled by a, 
and 2”~! — 1 cycles of length 2 whose edges are labeled by b,c, d. It is 
regular of degree 4, with one loop at each edge. The loop contributes 1 
to the degree of the vertex because all generators are elements of order 2, 
and the labeling of the loop is uniquely determined by the labeling of the 
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Figure 10.2 The finite Schreier graphs of the first, second, and third level 


other edges around the vertex, as edges around one vertex are labeled 
by {a, b,c, d}. 

The graphs I:, € € 07 corresponding to the action of G on the 
boundary are infinite and have either two ends or one end. The graph 
Ij corresponding to the orbit of the rightmost infinite ray, is one-ended 
(see Figure 10.3), and so are then all graphs in the same orbit. 

All the other graphs Tg, ¢ G-1°, are two-ended. They are all 
isomorphic as unlabeled graphs. 

In [80], Vorobets studied the closure ¥ (87) of the space of Schreier 
graphs in the space of isomorphism classes of rooted labeled graphs. We 
recall some of his results next. He showed that the one-ended graphs 
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Figure 10.4 Connecting two copies of yo 


are exactly the isolated points of this closure F(d7 ), and that the other 
points in F(07) are two-ended graphs. This suggests we consider the 
compact set 


X := F(0T) \ {isolated points}. 


The group G acts on X by changing the root of the graph. The arising 
dynamical system is denoted as (X,G). It is minimal and uniquely 
ergodic and its invariant probability measure will be denoted as v. 

A precise description of X can be given as follows. The space X 
consists of all isomorphism classes of two-ended rooted Schreier graphs 
{z,&) : € € 07 \ G- 1°} and of three additional countable families 
of isomorphism classes of two-ended graphs. These families are obtained 
by gluing two copies of the one-ended graph T's,& € G- 1°, at the root 
in three possible ways corresponding to choosing a pair (b,c), (b, d) or 
(c, d@) and then choosing an arbitrary vertex of the arising graph as the 
root. One of these three possibilities is shown in Figure 10.4. There, the 
chosen pair is (c,d) and to avoid confusion with other edges with the 
same labels, labels at the gluing point are denoted with a prime. 

The decomposition of X into isomorphism classes of the (/:,&) and 
the three families mentioned above immediately gives rise to a factor 
map 


o:X > dT, 
which is one-to-one except in a countable set of points, where it is three- 


to-one. In fact, the inverse map ¢~! exists on the complement 97 \ G- 1° 
of the orbit of the point 1° and agrees there with F. 
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Under this factor map @ the unique G-invariant probability measure 
v on X is mapped to the uniform Bernoulli measure pz on dT = {0, 1}. 


5.3. Laplacian Associated to the Schreier Graphs of G 


Whenever a group H acts on a measure space (Y,m) by measure 
preserving transformations, one obtains the Koopman representation @ 
of H on L?(Y,m) via 


o(h) : L?>(Y,m) — L?(Y,m), (ef)(y) = fiat y), 


(for h € H). Any @(h) is then a unitary operator (as the action is measure 
preserving). 

In our situation when H = G, we have moreover that for s € {a, b, c, d} 
the unitary operator @(s) is its own inverse (as s is an involution) 
and hence must be self-adjoint. In particular, for any set of parameters 
t,u,v,w € R we obtain a self-adjoint operator 


Mo(t, u,v, w) = te(a) + ue(b) + ve(c) + we(d). 


Consider first an arbitrary € € 07. Then, there is an action of G 
on the (countable) vertex set of V(Iz) of Iz. Specifically, s € {a, b, c, d} 
maps the vertex « € V(I¢) to the vertex sx, which is the unique vertex of 
V (Ie) connected to v by an edge of label s. Clearly, this action preserves 
the counting measure on V (J). Thus, we obtain a representation @¢ of 
G on £7(V(I¥)). 


Definition 5.3 (Laplacian on the Schreier graph). An operator 
Mz (t, u,v, w) defined by 


Mg(t, u,v, w) = Mo, (t, u,v, w) = tog (a) + uae (b) + vae(c) + woe (d) 


with € € 07 and t, u,v, w € R will be called (weighted) Laplacian on the 
Schreier graph Ve. 


Remark 5.4. 


e It is possible to understand V(t) as G/Gs, where Gs is the 
stabilizer of € in the action of G on 07, and then g¢ is the quasi- 
regular representation QG/G, associated to G/G¢. 

e If t,u,v,w are positive with 1 = t+u+vu+_w then it is possible 
to interpret the operators Mz as the Markov operators of a random 
walk on the graph I. In the general case, the operator Mz (t, u, v, w) 
can still be seen as the natural weighted “adjacency matrix” or 
“Laplacian” associated to the the graph I>. 


We can also equip 87 = {0,1} with the uniform Bernoulli measure 
w and consider the Koopman representation 1 of G on L?(dT, ) given 
via 


n(g): L2(8T, 4) — L(0T, uw), x(g)f (2) = f(g7'2). 


ROSTISLAV GRIGORCHUK, DANIEL LENZ, AND TATIANA NAGNIBEDA 288 


This is a unitary representation of G and any z(s), s € {a, b,c, c}, is 
a unitary self-adjoint involution. For t,u,v,w € R we then obtain the 
operator M, (t, u,v, w) via 


M(t, u,v, w) = tr(a) + um(b) + va (c) + wr (d). 


The following is a crucial result on the spectral theory of the above 
operators. 


Theorem 5.5 (Independence of spectrum (Bartholdi / Grigorchuk 
[10])). For any given set of parameters t,u,v,w € R the spectrum 
of Mg(t,u,v,w) does not depend on € € dT and coincides with the 
spectrum of M(t, u,v, w). 


Of course, there is the question what the spectrum is in terms of the 
parameters t,u,v,w. A complete answer was given in [10] in the case 
u = vu = w. The spectrum then consists of two points or one or two 
intervals, and an explicit description of the spectrum can be given in 
terms of the parameters ¢ and u = v = w. In fact, the case u = v = w is 
the case of periodic Schrédinger-type operators and can easily be treated 
by classical means (Floquet decomposition). It can also be treated by the 
method suggested in [10]. Below we will come to a main result of [39], 
which treats the case, where u = v = w does not hold. We will see that 
in this case, the spectrum is a Cantor set of Lebesgue measure zero. 


6. The Connection between (X,G) and (2;, T) 


We will now provide a map gr from £2; to (isomorphism classes of) 
labeled rooted graphs with labels belonging to the alphabet {a, b,c, d}. 
To get a better understanding of this map it will be useful to think of 
the letters x, y, z as encoding the pairs 


(-): (1): () 


respectively. Roughly speaking the map will convert w € Q, into a graph 
with vertex set Z and root 1 and edges between n and n+ 1 with labels 
from {a,b,c,d} according to the value of w(n). We now provide the 
precise definition of the labeled rooted graph gr(w) associated to 
w € £2, as follows: 


Vertices: The set of vertices is Z. 
Root: The number 1 is chosen as the root. 


Edges: There are edges between vertices n, k if and only if |n—k| < 1. 
Specifically, edges are assigned between n and n+1 and from n to itself 
and from n+ 1 to itself in the following ways: 


e If w(n) = a, then there is an edge between n and n+ 1 and this 
edge carries the label a. 
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e If w(n) = «, then there are two edges between n and n+ 1; one 
carries the label b and the other carries the label c. Moreover, there 
is an additional edge from n to itself labeled with d and an additional 
edge from n+ 1 to itself labeled with d. 

e If w(n) = y, then there are two edges between n and n+ 1; one 
carries the label 6 and the other carries the label d. Moreover, there 
is an additional edge from n to itself labeled with c and an additional 
edge from n+ 1 to itself labeled with c. 

e If w(n) = z, then there are two edges between n and n+ 1; one 
carries the label c and the other carries the label d. Moreover, there 
is an additional edge from n to itself labeled with b and an additional 
edge from n+ 1 to itself labeled with b. 


We note that in this way any vertex has for each label {a, b, c, d} exactly 
one edge of this color emanating from it. We also note that the arising 
graphs have a “linear structure” in a natural sense. 

Let G({a, 6,c,d}) be the metric space of isomorphism classes of 
connected rooted graphs labeled with elements from {a, b,c, d}. Then, 
gr gives rise to a map Gr via 


Gr : 2, — G,({a, b, c, d}),@ + [gr(@)], 


where [(I", v)] is the equivalence class of (7, v) in the space of isomor- 
phism classes of the rooted connected labeled graphs. 

Recall that X denotes the closure of F(7) in G,({a, b, c, d}) without 
its isolated points (see Section 5.2). Then, it turns out that the image 
of 2, under Gr is exactly X. In fact, much more is true and this is our 
main result on the connection of the subshift (,, T) and the dynamical 
system (X, G). 

Define the maps A, B, C, D from Q, into itself by 


e Aw) =...@9@1|o9... if wy = aand A(@) = ....w_1|@9@}... if wo = a. 

e Bia) = ...w9@1|@2... if m1 € {x,y}, Bw) = ....©-1|@0@1... if wo € 
{x,y} and B(@w) =o in all other cases. 

e C(@) = ...w~9@1|@2... if w1 € {2,z}, C(w) = ....w_1|@0@1... if wo € 
{x,z} and C'(w) = @ in all other cases. 

e Dw) = ...@9@1|@2... if w1 € {y, z}, D(w) = ....@_1|@0@1... if wo € 


{y, z} and D(w) =o in all other cases. 


Then, clearly, A, B, C, D are homeomorphisms and involutions. Let H be 
the group generated by A, B, C, D (within the group of homeomorphisms 
of 2r). 


Theorem 6.1 (Factor theorem [39]). The following statements hold: 
(a) The group G is isomorphic to the group H viagogo: G — H with 
o(a) = A, o(b) = B, e(c) = C and e(d) = D. In particular, there is a 
well defined action a of G on 2, given by ag(@) := e(g)(@) forge G 
and w € Q, and via this action we obtain a dynamical system (827, G). 
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(b) The dynamical system (X, G) is a factor of the dynamical system 
(2,,G) with factor map 


Ww: 2, > X,or Gro), 


which is two-to-one. 

(c) For every w € Q, the orbits {Tw : n € Z} and {ag(w) : g € G} 
coincide. 

(d) The dynamical system (@;, G) is uniquely ergodic and the unique 
T-invariant probability measure on 82, coincides with the unique G- 
invariant probability measure on Q2,. 


The theorem provides a factor map  : (82,, G) —> (X, G). In Section 
5.2 we have already encountered the factor map @ : (X, G) —> (07, G). 
Putting together these factor maps provides a tower of extensions of the 
dynamical system (07, G) via 


GOS B63 6no. 


The remainder of this section is devoted to providing additional perspec- 
tive on these two factor maps by discussing their respective merits. We 
will see that ¢ resolves the nontypical points and that w& allows one to 
embed the group G into a topological full group. 

We will need a bit of notation first. Let (Y, H) be a dynamical system. 
Let g € H be given. Then, an y € Y is called g-typical if either gy 4 y or 
g acts trivially in some neighborhood of y. If y € Y is g-typical for any 
g € © it is called typical. Typical points have many claims to be indeed 
typical. For instance, if H is countable the set of typical points has a 
meager complement [38]. Moreover, the following is shown in [38, 43]. 


Proposition 6.2. Let (Y,H) be a minimal dynamical system and H 
finitely generated. Then, the Schreier graphs of typical points are locally 
isomorphic (as labeled graphs). Moreover, if x € Y is typical and y € Y 
is not typical, then any ball around a vertex of Iz is isomorphic to a ball 
around some vertex in Ty. 


In order to get a further understanding of the typical points we 
will next discuss another characterization of typical points. Define the 
stabilizer of y as 


sty(y) = {he H: hy = y} 
and the neighborhood stabilizer of y as 
st?, :={g eH: h acts as identity on a neighborhood of y}. 
Then, st?, is a normal subgroup of sty and the quotient 
germ(y) := sta (y)/st}y 
is called the group of germs (at y). 


SCHREIER GRAPHS OF GRIGORCHUK’S GROUP AND THE SUBSHIFT OF J 291 


Lemma 6.3. Let (Y,H) be a dynamical system. Then, the following 
assertions are equivalent for y€ Y. 


(i) The point y is typical. 
(it) The group of germs germ(y) is trivial. 


Proof. (i)=> (ii): If germ(y) is not trivial, there exists a g € sty \ ate 
Hence, y is not typical. 

(ii)= > (i): If y is not typical then there exists a g € G with gy = y 
but g is not the identity on a neighborhood of y. Thus, germ(y) is not 
trivial (as it contains the class of g). 


Remark 6.4. The points y with nontrivial germ(y) are sometimes called 
singular. 


We will also need the concept, going back to [34], of the (topological) 
full group |[T]] of a homeomorphism T : Y —> Y of the Cantor 
set Y. This is the group of those homeomorphisms of Y that at any 
point coincide locally with powers of T. It is a countable group with 
remarkable properties. For example, it is amenable if T is minimal [52], 
its commutator subgroup is simple [68] and it is finitely generated if and 
only if T is a minimal subshift over a finite alphabet [68]. 

After these preparations we can now come back to the situation at 
hand. 

The dynamical system (07,G) is minimal and uniquely ergodic. 
However, it does have nontypical points. In fact, a point in d7 is 
nontypical if and only if it belongs to G- 1% (see [80]). 

The dynamical system (X,G) is minimal and uniquely ergodic (as 
discussed in [79]). Moreover, all its points are typical (as can easily be 
seen). Thus, from this point of view a key merit of (X, G) is to provide 
an extension of (07, G), which resolves the nontypical points. 

The dynamical system (82,, G) is minimal and uniquely ergodic and 
all its points are typical (as can easily be seen). Moreover, as shown in 
the previous theorem, the dynamical system (2,, G) has the additional 
feature that there exists a homeomorphism T on £2; such that the orbits 
of T agree with the orbits of G. This in turn can be seen to imply that G 
is a subgroup of the topological full group [[T]] of T. In fact, G embeds 
into the topological full group of (Q,, 7), as the action of generators 
a,b,c,d on Q; can be represented locally as the action by T*! and 
T° = id, so G embeds into [[T]]. 


Remark 6.5. In this context we also mention a recent article of Matte 
Bon [67] showing that the group G (and other groups of intermediate 
growth introduced by the first author in [36]) embed into the topological 
full group [[@]| of a minimal subshift @ over a finite alphabet. While 
his approach is different from ours it leads to the same subshift for the 
group G. 
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We finish this section with the question whether there exists a minimal 
homeomorphism S$ on X such that G is a subgroup of [[.]]. 


7. Spectral Theory of the M; for & € 07 


In this section we discuss spectral theory of the operators Mg (t, u, v, w) 
(defined in Section 5.3) for arbitrary values of the parameters ¢, u, v, w. 
We do so by using the results of the previous section to transfer the 
problem of spectral theory of the operators Mg(t, u,v, w), € € dT, to 
the field of the spectral theory of discrete Schrodinger operators with 
aperiodic order, (Hy)wea,- 

We consider the subshift (§2;, 7). In order to define the Schrédinger 
operators we will define specific functions f, g on $2; depending on four 
real parameters t, u,v, w. Given these parameters we set 


D:=utv+tw 


and define 
t : a=a 
ak eee ae Oe D-w : w=z 
nee es DME NR Rae wo = Y 
D-u : w=z 


and 
w: woo, € {ax, xa} 
g: 2, —R by ge) := v:@oo, € {ay, ya} . 
U: @9@1 € {az, za} 


For a given (t, u,v, w) € R* and these f,g we let Hy, w € Q,, be the 
associated operators. 


Proposition 7.1. [39] Let (t, u,v, w) € R* be given. Let & € 9T\G-1% 
be arbitrary. Then, there exists a w in 2, such that Hy, is unitarily 
equivalent to Mz(t, u,v, w). 


As a consequence of the previous proposition we can translate results 
on the spectral theory of Schrédinger operators associated to (827, T) 
into results on the Mz. Recall that the spectrum of the Mé(t, u, v, w) 
does not depend on & € dT (due to Theorem 5.5). Define 


P :={(t,u,v,w) €R*:t£0,ut+v40,u+w40,0+w FO}. 


Theorem 7.2 (Intervals vs Cantor spectrum for the Mg [39]). Let 
(t,u,v,w) € P be given and let X = X(t, u,v, w) be the spectrum of the 
associated family of Laplacians Mg(t, u,v, w), € € AT\G-1%. Then, the 
following holds: 

(a) Ifu=v=vw holds then X consists of one or two closed nontrivial 
intervals and all spectral measures are absolutely continuous. 
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(b) Ifu =v = w does not hold then X is a Cantor set of Lebesgue 
measure zero and no spectral measure is absolutely continuous. 


Proof. Note that (82,;,T) is linearly repetitive due to Theorem 3.1. 
Now, the theorem is a direct consequence of the preceding proposition 
and Theorem 2.7 and Theorem 2.9. 


Remark 7.3. The case u = v = w has already been treated in [10] and 
an explicit description of the spectrum (in terms of the value of wu) has 
been given there. 


We can also use the above considerations to translate results on 
absence of eigenvalues from the (H,) to the Mz. Recall that denotes 
the uniform Bernoulli measure on 87 = {0, 1}. 


Theorem 7.4 (Absence of eigenvalues [39]). Let (t,u,v,w) € P be 
given and assume that u =v = w does not hold. 

(a) For -almost every — € dT the operator Mz does not have 
eigenvalues. 

(b) For any x € 6-'(G1%) the operator Mz, does not have eigenvalues. 


Remark 7.5. The considerations of this section are concerned with the 
case (t,u,v,w) € P. For (t,u,v,w) ¢ P the operators in question can 
be decomposed as a sum of finitely many different finite dimensional 
operators each appearing with infinite multiplicity. Thus, the spectrum is 
pure point with finitely many eigenvalues each with infinite multiplicity. 


8. Integrated Density of States and Kesten-von 
Neumann-Serre Spectral Measure 


In this section we show that the Kesten-von Neumann-Serre spectral 
measure actually agrees with the integrated density of states. This result 
is folklore and certainly known to experts. Also, a recent discussion of a 
somewhat more general setting can also be found in [1]. Here, we give a 
direct reasoning for the case at hand based on our approach. 

There is one more representation studied in the context of spectral 
approximation. This representation comes from the action of G on 
the nth level of the tree. This action clearly preserves the counting 
measure on the finite set of vertices of the nth level. It hence gives 
rise to a representation @, of G on the finite dimensional LOVER: 
For t, u,v, w € R we then set 


Mn(t, u,v, w) = Mo, (t, u,v, w) = ton(a) + uon(b) + ven(c) + won(d). 
To each such My (t, u, v, w) we associate the spectral distribution which 
is the measure [17 (t, u, v, w) on R given by 


1 
UL (t, u,v, w) aa AT on, 
7 [Vr dX 
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where the sum runs over eigenvalues FE of M(t, u,v, w) counted with 
multiplicities. In the case u = v = w it is shown in [47] that the measures 
Len, n € N, converge weakly and the limiting measure 


Hoolt, U,v, w) = lim Ln(t, u,v, w) 
n> Co 


is called the Kesten-von Neumann-Serre spectral measure there. Next 
we will show that the limiting measure exists for any values of the 
parameters t, u,v, w and coincides with the integrated density of states 
of the associated Schrodinger operators. 

For given (t, u,v, w) € R* we chose the functions f, g as in the previous 
sections and let H,, w € §2,, be the associated Schrodinger operators. 


Theorem 8.1. Let (t, u,v, w) € R* be given. Forn €N, let Mn(t, u, 
v,w) be the corresponding operator on In, and Un its spectral dis- 
tribution. Then, the sequence of measures (Un(t,u,v,W))n converges 
weakly towards the integrated density of states k of (H,). In particular, 
the integrated density of states of the (H,) agrees with the Kesten-von 
Neumann-Serre spectral measure. 


Proof. The graph arising from restricting gr(w™) to the vertex set 
[1,|z”"(a)|| and the graph Il, differ at most in six loops at the ends (as 
is clear from the definitions). Thus, a simple variant of the argument 
in Proposition 7.1 shows that the restriction of the operator H,.x) to 
[1,|t”(a)|| is a perturbation of Mp(t, u,v, w) of rank at most six. Now, 
Corollary 2.5 gives the convergence of the un(t,u,v,w) toward the 
integrated density of states of (H,). 


A few comments on this result are in order. They are gathered in the 
next remark. 


Remark 8.2. 


e Denote the &-independent spectrum of the operators Mg (t, u, v, w) 
by X(t, u, v, w) and the spectrum of Mn(t, u, v, w) by En(t, u, v, w). 
Clearly, the support of wp is given by p(t, u,v,w). By Theorem 
2.2 the support of the integrated density of states k is given 
by X(t,u,v,w). Thus, the previous theorem gives in a certain 
and very weak sense the convergence of the U,(t, u,v, w) toward 
X(t, u, v, w). In particular, there is an inclusion of spectra as shown 
in Corollary 2.6. 

In the case at hand, however, convergence of the Xp (t, u, v, w) 
toward X(t, u,v, w) holds in a much stronger sense. More precisely, 
the results of [10] give the following: 

(a) En(t, u,v, w) C Unit, u,v, w) for all ne N. 

(b) X(t, u,v, w) = Un En, wu, v, w). 

These are rather remarkable features and not at all true for 
approximation of the integrated density of states of Schrddinger 
operators in other cases. 
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Let us note, however, that there are recent results on approxima- 
tion of spectra of minimal subshifts with respect to the Hausdorff 
distance by suitable periodic approximations [13]. 

It is also worth pointing out that the approximation of the spectra 
is done “from below” i.e. via unions. This is in contrast to other 
approximation schemes used in the investigations of Schrddinger 
operators, where the approximation is done “from above” i.e. via 
intersections. 

The feature presented in the preceding two points also explain that 
the spectrum of M,(t, u, v, w) agrees with X(t, u, v, w). The reason 
is simply that the representation a decomposes as a sum of the 
representations 7, (as shown in [10]). Accordingly, Mz (t, u, v, w) is 
a sum of the finite dimensional operators My(t, u, v, w), n € N, and 
its spectrum is then given as the closure of the union of the spectra 
of the My(t, u,v, w). 
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Abstract 

We prove that random walks on Thompson’s group F driven by strictly 
non-degenerate finitely supported probability measures have a non- 
trivial Poisson boundary. The proof consists in an explicit construction of 
two non-trivial u-boundaries. Both of them are described in terms of the 
‘canonical’ Schreier graph I on the dyadic-rational orbit of the canonical 
action of F' on the unit interval (in fact, we consider a natural embedding 
of F into the group PLF(R) of piecewise linear homeomorphisms of the 
real line, and realize T on the dyadic-rational orbit in R). However, the 
definitions of these u-boundaries are quite different (in perfect keeping 
with the ambivalence concerning amenability of the group F’). The first 
u-boundary is similar to the boundaries of the lamplighter groups: it 
consists of Z-valued configurations on I arising from the stabilization of 
logarithmic increments of slopes along the sample paths of the random 
walk. The second p-boundary is more similar to the boundaries of the 
groups with hyperbolic properties as it consists of sections (‘end fields’) 
of the end bundle of the graph I, i.e., of the collections of the limit ends 
of the induced random walk on I’ parameterized by all possible starting 
points. The latter construction is more general than the former one, and 
is actually applicable to any group which has a transient Schreier graph 
with a non-trivial space of ends. 
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Introduction 


General Setup 


The group F introduced by Richard Thompson in 1965 is the group of 
the orientation preserving piecewise linear dyadic self-homeomorphisms 
of the closed unit interval [0,1]. Arguably, the most important open 
question about Thompson’s group F is the one about its amenability 
(see Cannon-Floyd [CF11] for its history). Indeed, due to the plethora 
of rather unusual properties of this group (described, for instance, 
by Cannon-Floyd—Parry in [CFP96]) either answer would imply very 
interesting consequences. This problem has recently attracted a lot of 
attention, with an impressive number of failed attempts to prove either 
amenability or non-amenability of the group F’. 

We are inclined to believe that Thompson’s group F' is non-amenable, 
and several recent papers provide circumstantial evidence for that. There 
are various computational experiments due to Burillo—Cleary—Wiest 
[BCW07], Haagerup—Haagerup—Ramirez Solano [HHRS15], Elder— 
Rechnitzer-van Rensburg [ERJvR15]. Besides that, Moore [Mool13] 
proved that if the group F is amenable, then its Fglner function must 
be growing very fast (note, however, that, as it was shown by Erschler 
[ErsO6b], even the groups of intermediate growth may have the F¢lner 
function growing faster than any given function). 

The solution of a conjecture of Furstenberg [Fur73] by Rosenblatt 
[Ros81] and by Kaimanovich—Vershik [KV83] provides the following 
characterization of amenability in terms of the Liouville property. 
A countable group is amenable if and only if it carries a non-degenerate 
Liouville random walk (i.e., such that its Poisson boundary is trivial). 
Therefore, a possible approach to proving non-amenability of a given 
group consists in showing that there are no non-degenerate Liouville 
random walks on it. Here we make the first step in this direction for 
Thompson’s group F’ by showing that the random walk (Fw) is non- 
Liouville for any strictly non-degenerate finitely supported measure ju. In 
particular, the simple random walk on the Cayley graph of F determined 
by any finite generating set is non-Liouville. 

Independently of the amenability issue, Thompson’s group F 
was among the first examples of finitely generated groups with an 
intermediate (between 0 and 1) value of the Hilbert space compression 
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which were exhibited by Arzhantseva-Guba-Sapir [AGS06]. They proved 
that for the group F' this value is equal to 1/2 and asked about existence 
of a compression function strictly better than the square root [AGS06, 
Question 1.4]. However, as it has been pointed out by Gournay [Goul4], 
in view of the results of Naor—Peres [NPO8], if such a function exists, 
then the group must be Liouville. In combination with our result, it 
allowed Gournay to answer the above question of Arzhantseva—Guba-— 
Sapir in the negative: the best Hilbertian equivariant compression 
function for Thompson’s group F' is (up to constants) the square root 
function. 


Main Results 


The overwhelming majority of the currently known examples of an 
explicit non-trivial behavior at infinity for random walks on discrete 
groups falls into one of the following two classes. The first class consists 
of the examples, for which this behaviour is due to some kind of boundary 
convergence usually related to more or less pronounced manifestations 
of hyperbolicity (the most representative example being, of course, 
the word hyperbolic groups), see the papers by Kaimanovich—Vershik 
[KV83], Kaimanovich [Kai85, Kai89, Kai94, Kai00], Kaimanovich—Masur 
[KM96], Karlsson—Margulis [KM99], Karlsson—Ledrappier [KLO7], 
Maher-Tiozzo [MT14]. For the examples from the second class a 
non-trivial behavior at infinity is provided by pointwise stabilization 
of random configurations in a certain way associated with the sample 
paths of the random walk (this situation is exemplified by the lamplighter 
groups), see Kaimanovich—Vershik [KV83], Kaimanovich [Kai83, Kai91], 
Erschler [Ers04a, Ers04b, Ersl1], Karlsson—Woess [KWO7], Sava 
[Savl0a], Lyons—Peres [LP15], Juschenko—Matte Bon—Monod-de la Salle 
(JMBNdIS15]. 

In the case of Thompson’s group F’, true to its ambivalent nature, we 
actually construct asymptotic behaviours (u-boundaries) of both types. 
Note that although the descriptions of these «-boundaries are quite 
different, currently we do not know anything about their mutual position 
in the lattice of all 4-boundaries determined by a fixed step distribution 
ju. In particular, a priori it is not excluded that both these u-boundaries 
might in fact coincide with the full Poisson boundary. 

Our main tool is the ‘canonical’ Schreier graph TY of the group F' 
(endowed with the standard generators) on the dyadic-rational orbit. of 
its canonical action on the unit interval. In principle one could argue 
just in terms of this action. However, it turns out to be much more 
convenient to ‘change the coordinates’ and to realize Thompson’s group 
F as a subgroup F & F of the group PLF(R) of piecewise linear 
homeomorphisms of the real line. [Actually, the geometry of the graph 
I completely described by Savchuk [Sav10b] is really begging for this 
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coordinate change.] Then one of the generators of F becomes just the 
translation y  y—1onR. The dyadic-rational orbit in the unit interval 
becomes the dyadic-rational orbit in R, so that we can identify the vertex 


set of T with the dyadic-rational line Z [3 CR. 


As we have already said, we construct two non-trivial ~-boundaries of 
the group F’. The first one is inspired by an analogy between Thompson’s 
group F = F and the lamplighter groups. Let fun(T,Z) (respectively, 
Fun(T, Z)) denote the additive group of finitely supported (respectively, 
of all) Z-valued configurations on the graph T. We assign to any element 
g of the group Fc PLF(R) aconfiguration Cg € fun(I’, Z) on FP = Z [3]. 
The value of Cg at a point y is the logarithmic increment of the slope of 
g at y (so that the support of C, is precisely the discontinuity set of the 
derivative of g). Or, in a somewhat different terminology, the support of 
the configuration Cg is the set of the break points of g, and its values are 
the logarithms of the jumps of g. 

By the chain rule the sequence of the configurations Cy, along a sample 
path (gn) of the random walk (F’, 2) satisfies a simple recursive relation. 
This relation is completely analogous to that for the random walks on the 
lamplighter groups. In precisely the same way as with the lamplighter 
groups [KV83, Kai83], if the measure w is finitely supported, then the 
transience of the induced random walk on P implies that the sequence Cz, 
almost surely converges to a random limit configuration Cog € Fun(T, Z). 
Then it is not hard to verify that the limit configuration Co, cannot be 
the same for all sample paths. Therefore, the space Fun(I, Z) endowed 
with the arising hitting distribution is a non-trivial ~-boundary. 

This argument (once again, precisely in the same way as for the 
lamplighter groups) is hinged on the transience of the induced random 
walk on I’. We establish it by showing that for the simple random walk 


on T SZ [3] there is a ‘drift’ that forces the 2-adic norm to go to 


infinity along the sample paths (this is the original argument we referred 
to in [Kai04a]). The classical comparison technique then leads to the 
transience of all random walks on I driven by strictly non-degenerate 
probability measures uw on F = F. 

Alternatively, the transience of the simple random walk on [ can 
be directly obtained from the explicit description of the geometry of 
the graph I due to Savchuk [Sav10b]. Indeed, after removing from I a 
countable set of rays isomorphic to Z; the remaining skeleton T is a tree 
roughly isometric to the standard binary tree. This fact readily implies 
that T is transient. 

In a recent preprint [Mis15] Mishchenko gives yet another proof of the 
transience of the Schreier graph I’, which is based on an explicit Dirichlet 
norm estimate. He then notices that due to a specific geometry of I’ this 
transience implies existence of a non-trivial behaviour at infinity for the 
simple random walk on I’. Because it is induced by the simple random 


VADIM A. KAIMANOVICH 304 


walk on Thompson’s group F’, the latter also has a non-trivial behaviour 
at infinity. This argument gives a different proof of the absence of the 
Liouville property for the group F’. 

Mishchenko settles just for showing non-triviality of the Poisson 
boundary by exhibiting a non-trivial finite partition of it. However, 
his observation can be developed much further. Indeed, a transient 
finite range random walk on any graph necessarily converges in the end 
compactification. If the graph is endowed with a transitive group action 
that commutes with the transition operator of the random walk, then the 
dependence of this limit end on the starting point is equivariant, so that 
it is the same boundary behaviour of the random walk on the group that 
is exhibited independently of the starting point in the graph. However, 
this is not the case in our situation, and here, in order to obtain a p- 
boundary (i.e., an equivariant quotient of the Poisson boundary), one 
has to consider the collections of the limit ends of the induced random 
walk on I’ parameterized by all possible starting points. In other words, 
the resulting -boundary is realized on the space of sections of the end 
bundle 


rxorof, (y,a)-> y 


over the graph T' (here dT is the space of ends of I). One can consider 
these sections as ‘end fields’ on IT’, or, in yet another terminology, as 
elements of the space Fun(T,dV) = (aF)! of aP-valued configurations 
on I. By using certain self-similar features of the graph I’ (although its 
group of automorphisms is trivial, it has a lot of partial isomorphisms 
between its subsets), it is easy to check that this u-boundary is non- 
trivial. 

Our construction of a 4-boundary of the group F on the configuration 
space Fun(I,Z) heavily used the specific features of this group. On 
the contrary, the above construction of a u-boundary on the space of 
sections Fun(I’, dP) of the end bundle over the Schreier graph I is much 
more general. It produces a non-trivial 4 boundary for a strictly non- 
degenerate finitely supported step distribution yz on an arbitrary finitely 
generated group G whenever there exists a Schreier graph I of this group 
with the following two properties: 


(i) the graph I is transient with respect to the simple random walk; 
(ii) for the induced random walk (I, w) its harmonic (hitting) distri- 
butions on the space of ends oI are not one-point measures. 


Returning to Thompson’s group F = F, let us notice that the space 
of ends of its canonical Schreier graph I splits into a disjoint union 


or = oF Ud_Tua,r. 


Here aI is the space of ends of the skeleton , which can be identified 
with the space 7 = Z2\2Z of 2-adic integers of norm 1. The sets 
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d_T',04I are countably infinite and correspond to the rays attached 
to the respective subsets of the skeleton. The components of the above 
decomposition, on which the hitting measures are actually concentrated, 
can be further described in terms of the barycentres of the images of the 
measure ,4 under its homomorphisms to Z. In particular, in the centered 
case the hitting measures are supported by Fun(I, dP). 


Structure of the Chapter 


The chapter has the following structure. In Section 1 we remind the 
reader of the background definitions concerning the Poisson boundary of 
a random walk on a discrete group, the Liouville property and its links 
with amenability. We also remind the reader of how one obtains a non- 
trivial behaviour at infinity for the lamplighter groups, as this was our 
source of inspiration for constructing one of the u-boundaries of Thomp- 
son’s group (the one realized on the configuration space Fun(I, Z)). 

Further, in Section 2 we introduce Thompson’s group F' and describe 
its realization as a subgroup F & F of the group PLF(R) of piecewise 
linear homeomorphisms of the real line. 

In Section 3 we establish the transience of random walks on the 
Schreier graph I’. First we do that for the simple random walk (Theorem 
3.2), and then, by using a general comparison criterion (Theorem 3.4), 
for the random walk determined by any strictly non-degenerate step 
distribution (Theorem 3.7). 

In Section 4 we construct a non-trivial -boundary of the random 
walks on Thompson’s group F' = F, which is determined by stabilizing 
Z-valued configurations on the Schreier graph I. It is realized on the 
configuration space Fun(I, Z) (Theorem 4.4). 

In Section 5 we establish several geometric properties of the Schreier 
graph T and its space of ends. 

These properties are then used in Section 6 to construct another non- 
trivial w-boundary of random walks on Thompson’s group F = F. It 
is determined by the convergence of the induced random walk on the 
Schreier graph I to its ends, and it is realized on the space of sections 
Fun(I, a0) of the end bundle over I (Theorem 6.1). In Theorem 6.2 we 
generalize this result to arbitrary groups that admit a transient Schreier 
graph with a non-trivial space of ends. In Theorem 6.3 we further specify 
the components of the space Fun(I, dl’), which actually support the p- 
boundary constructed in Theorem 6.1. 

Finally, Section 7 contains a discussion of possible future developments 
and ramifications. 


Historical Comments 


The main result of this chapter (absence of the Liouville property for 
Thompson’s group F’) was presented in the talks ‘Boundary Behaviour 
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of the Thompson Group’ at the conference ‘Combinatorial, Geometric, 
and Dynamical Aspects of Infinite Groups’ (1-6 June 2003, Gaeta, Italy) 
and at the special session ‘Probabilistic and Asymptotic Aspects of 
Group Theory’ of the AMS meeting in Athens, Ohio (26-27 March 
2004), and the slides of these talks [Kai04a] were circulated at that time. 
However, it took quite a while to complete the present version, and in 
the meantime some of its ideas and tools were independently developed 
by other authors. The geometry of the Schreier graphs of the group F’ 
corresponding to its canonical action on the unit interval was completely 
described by Savchuk [Sav10b, Sav15]; Mishchenko [Mis15] published a 
new proof of the absence of the Liouville property for the group F’ based 
on the analysis of Savchuk. 
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1. Preliminaries 


1.1. Random Walks on Groups 


The (right) random walk (G, 44) on a countable group G determined by a 
probability measure (the step distribution) 2 on G is the Markov chain 
on G with the transitions 


h 
gS gh, g eG, 


i.e., its transition probabilities 7, are the translates 


Tg = Gh, g eG, 


of the measure jw. Any initial distribution @ on G determines the associated 
Markov measure Pg on the space G+ of sample paths g = (go, 91,---)- 

By P = Ps, we denote the probability measure on the path space 
whose initial distribution is the point measure 5¢ concentrated on the 
identity e of the group G. For the random walk issued from the identity 
of the group (so that go = e), its position gn at time n is the product 
9n = hyho...hy of n independent identically -distributed increments hj, 
and the distribution of gy (the nth marginal distribution of the measure 
P) is the n-fold convolution of the measure jw. 
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1.2. Poisson Boundary 


The measure Py, whose initial distribution is the counting measure # 
on the group G, is invariant with respect to the time shift 7 on the path 
space. The Poisson boundary 0, G of the random walk (G, 2) is the space 
of ergodic components of the time shift T on the space (G2+, Px), and 
any initial probability distribution 6 on G determines the corresponding 
harmonic measure vg on 0,G as the image of the measure Pg < P» on 
the path space under the quotient map G2+ > du G. We emphasize that 
the Poisson boundary is defined in the measure category only. Below, 
when talking about the Poisson boundary 4, G, we shall always (unless 
otherwise specified) endow it with the measure v = vs,, which is the 
harmonic measure of the initial distribution 5¢. 

The Poisson boundary is equipped with the natural (left) action of the 
group G induced by the coordinate-wise left translations on the path 
space, and for an arbitrary initial distribution 6 


vg =O*v = > 6(g) gv. 
g 


Although the harmonic measure v need not be quasi-invariant (if the 
semigroup sgrj generated by the support supp of the measure pu is 
smaller than G), it is w-stationary with respect to this action in the 
sense that 


pwxv= >> wg) gv =v. 
g 


1.3. Poisson Formula 


A function f on G is called -harmonic if it is preserved by the Markov 
operator of the random walk (G, jz) 


Put (9) = >> wf (gh), 
h 


ie, if Pf =f. The Poisson formula 
fO=F.9), 9 €G, (11.1) 


establishes an isometric isomorphism between the space H™(G, jw) of 
bounded y-harmonic functions f on the group G and the L® space 
on the Poisson boundary 0, with respect to the quotient measure class 
determined by the measure Py. It consists in assigning to a boundary 
function - the function f on G whose value at a point g € G is the 
result of the integration of the function f against the translate gv of the 
harmonic measure v. 

If g in the Poisson formula (11.1) is only allowed to take values in 
the semigroup sgrj, then (11.1) becomes an isometric isomorphism 
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between the space of bounded y-harmonic functions on sgryw and 
L™(0,G,v). This property uniquely characterizes the Poisson boundary. 
Namely, any G-space (B,A), for which formula (11.1) is an isometric 
isomorphism between the spaces H™(segr jw, “) and L°(B,A), is isomor- 
phic to the Poisson boundary (0,G,v) in the category of measure G- 
spaces. See Kaimanovich—Vershik [KV83], Kaimanovich [Kai92, Kai00], 
Furman [Fur02], Erschler [Ers10] for more background on the Poisson 
boundary. 


1.4. Stability of the Liouville Property 


A random walk (G, 2) is called Liouville if the harmonic measure v is 
a point measure. The reason for this terminology is that in view of 
the Poisson formula (11.1) this property means that there are no non- 
constant bounded u-harmonic functions on the semigroup sgr jz. One can 
easily see that the latter property is actually equivalent to the absence 
of non-constant bounded p-harmonic functions on the whole group gr ju 
generated by supp ju as well. 

There are numerous examples showing that one can have both 
Liouville and non-Liouville random walks on the same group (e.g., see 
Kaimanovich-Vershik [KV83], Kaimanovich [Kai83], Erschler [Ers04b, 
Ers04a], Bartholdi-Erschler [BE11]), and it is still not clear how the 
Liouville property for random walks on the same group depends on the 
step distribution jw. 

The main open problem here is 


whether all finitely supported symmetric measures ju with 
gr = G on the same finitely generated group G are either 
Liouville or non-Liouville simultaneously, 


or, in a somewhat different form, 


whether the Liouville property is stable with respect to rough 
isometries. 


In the absence of group invariance, for general graphs or Rie- 
mannian manifolds, the corresponding couterexamples to the stabil- 
ity of the Liouville property were first constructed by Terry Lyons 
[Lyo87]. We believe that such counterexamples should exist in the group 
setup as well. However, the pool of groups, for which the Liouville 
property has been studied, still remains quite limited in spite of 
some significant recent progress, see Bartholdi-Virag [BV05], Brieussel 
[Bri09], Bartholdi-Kaimanovich—Nekrashevych [BKN10], Amir—Angel— 
Virdg [AAV13], Amir—Angel—Matte Bon-Virdg [AAMBV13], Matte Bon 
[MB14], Kotowski-Virdg [KV15], Juschenko—Matte Bon—Monod-de la 
Salle [JMBNdIS15]. 
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1.5. Liouville Groups 


Because of the above there are different ways of calling a group G Liouville 
depending on which class of step distributions jz on G one considers (e.g., 
all measures, just finitely supported measures, finitely supported sym- 
metric measures, etc.). According to one popular definition a group G is 
called Liouville if the random walk (G, 2) is Liouville for any (possibly 
degenerate) finitely supported symmetric probability measure jz on G, 
see Matte Bon [MB14]. Sometimes one also talks about the Liouville 
property of a group G with respect to a finite symmetric generating set 
K (in which case one takes for 4 the measure equidistributed on K), see 
Gournay [Goul4]. 


1.6. Amenability and the Liouville Property 


Liouville groups (no matter which definition one takes) are always 
amenable, which goes back to Furstenberg [Fur73]. The reason for that 
is that a random walk (G, ww) is Liouville if and only if the sequence 
of Cesaro averages of the convolution powers of ju is asymptotically 
invariant with respect to the left action of the group gry on itself, 
see Kaimanovich—Vershik [KV83]. Therefore, the Liouville property of 
the random walk (G, 2) implies amenability of the group gr in a very 
constructive way. Conversely, as it was first conjectured by Furstenberg 
in [Fur73], and proved by Rosenblatt [Ros81] and Kaimanovich-Vershik 
[KV83], any amenable group G carries a symmetric measure w with 
supp “ = G such that the random walk (G, ) is Liouville. Note that 
there are amenable groups G, for which such a measure fu cannot be 
chosen finitely supported, as it was shown by Kaimanovich [Kai83], or 
even to have finite entropy, as it was shown by Erschler [Ers04b]. 

Therefore, a possible approach to proving non-amenability of a given 
group G consists in showing that it carries no Liouville random walks 
with non-degenerate step distributions pw (i.e., such that gru = G). 
Actually, in view of the above result of Rosenblatt and Kaimanovich— 
Vershik, it is enough to consider just the measures with sgr uz = G (we 
shall call such measures strictly non-degenerate; sometimes they are also 
called adapted), or even just with supp u = G. Here we take the first step 
in this direction for Thompson’s group F' by showing that random walks 
on F with finitely supported strictly non-degenerate step distributions 
are non-Liouville. 


1.7. Lamplighter Groups 


Our approach to the Liouville property for the group F consists in using 
the transience of the induced random walk on an auxiliary homogeneous 
space I" of the group to prove that certain configurations on I: associated 
with the elements of F’ stabilize along the sample paths of the random 
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walk. It is precisely this idea that was first used for exhibiting a non- 
trivial boundary behavior for random walks on lamplighter groups, see 
Kaimanovich-Vershik [KV83] and Kaimanovich [Kai83], and, to begin 
with, we shall outline it in the lamplighter context. 


Definition 1.1. The lamplighter group £(G) over a base group G is the 
semi-direct product of the group G and the additive group fun(G, Z/2Z) 
of finitely supported Z/2Z-valued configurations on G endowed with the 
action of G by translations. 


Remark 1.2. In the context of functional and stochastic analysis these 
groups were first introduced by Vershik and the author [KV83] under 
the name of the groups of dynamical configurations. However, this term 
did not stick, and the current generally accepted standard is to call them 
lamplighter groups. In the algebraic language L(G) is the wreath product 
with the active group G and the passive group Z/2Z. Since our purpose is 
just to outline the general idea, we are not discussing here more general 
wreath products (for which see, for instance, Kaimanovich [Kai91] and 
Erschler [Ers04a, Ers06a]). 


As a set, £(G) is the usual product of G and fun(G, Z/2Z), so that 
L(G) = {(g,®):9€ G, Be fun(G, Z/2Z)}. 


However, the group operation in £(G) is ‘skewed’ by using the left action 
of G on fun(G, Z/2Z) by the group automorphisms 


TI0(h) = O(g- th) (11.2) 


induced by the left action of the group G on itself by translations, so 
that the group multiplication in £(G) is 


(91, P1)(g2, 2) = (9192, P1 + T% G9). (11.3) 


The identity of L(G) is (e, 2), where e is the identity of G, and @ (the 
empty configuration defined as (h) = 0 for any h € G) is the identity of 
fun(G, Z/2Z). 


1.8. Stabilization of Configurations 


Given a probability measure yw on the group L(G), the positions (gn, Pn) 
of the corresponding random walk at two consecutive time moments are 
related as 


(9nt1, Pn41) = (Gn, On) (hn41, Ont); 


where (hj,g;) are the increments of the random walk. Therefore, by 
formula (11.3), 


9n41 = Gnhn+1; (11 4) 
Ony1 = On + T9 Gn41. 
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In particular, the first component gn € G, i.e., the image of the random 
walk (£(G), w) under the group homomorphism 


L(G) > G, (g,®) g, 


performs the quotient random walk on G driven by the image w’ of the 
measure 2 under the homomorphism L(G) > G. 

If the quotient random walk (G, ’) on G is transient (i.e., its sample 
paths almost surely go to infinity on G), and the support of the measure 
uw is finite (which implies that there is a finite set A C G which contains 
the supports of all the increments g,,), then formula (11.4) implies that 
the configurations ®,, almost surely stabilize as n > oo. It means that 
there exists a pointwise random limit 


Oo = lim ©, = 9, +T@ go + T?g34+..., 
n—>00 


which belongs to the space Fun(G, Z/2Z) of all (not necessarily finitely 
supported) Z/2Z-valued configurations on G. 

It is easy to see that under natural conditions on the measure jz the 
limit configuration ®, cannot be the same for a.e. sample path, and 
therefore the Poisson boundary of the random walk (L(G), 4) is non- 
trivial. This is the case if the measure py is non-degenerate in L(G), but 
actually this assumption can be siginificantly weakened, see Kaimanovich 
[Kai91]. The map, which assigns to a sample path (gn, ®n) the associated 
limit configuration ®. € Fun(G, Z/2Z), is obviously £(G)-equivariant 
(with respect to the action on Fun(G,Z/2Z) determined by the ‘con- 
figuration component’ of formula (11.3)). Therefore, the configuration 
space Fun(G, Z/2Z) endowed with the resulting hitting distribution is a 
p-boundary (an equivariant quotient of the Poisson boundary). 


2. Thompson’s Group F as a Subgroup of PLF(R) 


2.1. The Group F and Its Generators 


Definition 2.1. The Thompson group F is the group of the orientation 
preserving piecewise linear self-homeomorphisms g of the closed unit 
interval [0,1] that are differentiable except for finitely many break 
points t;, which are dyadic rational numbers, and such that the slopes a; 
are integer powers of 2: 


g(t) = a;t+ 6; on [ts, tea, 
k; 
ee eZ 3|, ag = 2", b,  Z[5], 


Fone 


see Figure 11.1. 
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Figure 11.1 The graph of an element of Thompson’s group F’ 


We refer to Cannon—Floyd—Parry [CFP96] for the general background 
on the group F’. In particular, it is finitely generated with the generators 


t 1 
Bo USt=3 
= 1 1 3 
AQt)=)t-G, 5<t<f 
2-1, 2<t<1 
and 
1 
t, 0<t<} 
fat °# 3 
_peras S84 
BW) = i. 8 7? 
Lee ga vss 
2-1, g<t<1 


see Figure 11.2 (where the arrow indicates that the graph of B restricted 


to the square 5. 1] x 5, 1] is precisely the graph of A rescaled by a factor 
of 2). 


The group F is finitely presented, and it is determined by the relators 
[AB1,4-1BA],  [AB-1,A-?BA?], 
1 


where [x,y] = zyz~ty~! denotes the usual group theory commutator. 
The group F is also often described by the infinite presentation 


-1 
(10,71, %2,..-|%, In X= In+1 for all k <n), 


which can be obtained from the previous one by putting 7% = A and 
In = Al-" BA"! for n > 0. 
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A rr ee re B 
ee. ey 
a cae 
Aa ~~ 
1 1 
3 
4 
5 
1 8 
2 1 
2 
1 
4 
0 1 3 1 0 1 3°71 
2 4 2 4 8 


Figure 11.2 The standard generators of Thompson’s group F 


2.2. Homomorphisms to Z 


The abelianization Z? of the group F is freely generated by the images 
of the above generators A and B. By xq and xp we denote the 
corresponding homomorphisms of F' to Z, i.e., the homomorphisms x, 
and x» are defined by putting, respectively, 


Xa(A) = 1, Xa(B) =0, 
and 
xy(A) =0, xo(B) =1. 


Geometrically, —xa(g) (respectively, xa(g) + xp(g)) is the base 2 
logarithm of the slope of the graph of g at the endpoint 0 (respectively, 
at the endpoint 1). 


2.3. Change of Variables 


The group F can also be realized as a subgroup of the group of piecewise 
linear homeomorphisms of the real line PE F(R) introduced by Brin—Squier 
[BS85] (also see Haagerup—Picioroaga [HP11, Remark 2.5]). 

More precisely, let t : [0,1] + R be defined by putting 


1 (11.5) 
t(l—-—]J=n-l1, n>1 
Qn 


and by linear interpolation in between, see Figure 11.3. 
Then one can easily see that t is a bijection between the set of dyadic- 


rational numbers in the open unit interval Z [3 N (0,1) and the whole 
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Figure 11.3 The change of variables from [0,1] to R 


set of dyadic-rational numbers Z [3]. After this change of coordinates 
the generators A and B of the group F (see Section 2.1) take the form 


Ayy=vy-1, veR, 
and 
Y; y <0, 
Biy)=45, O<y <2, 
Their inverses are, respectively, 
Aly)=y+1, yveR, 
and 
Y > y <0, 
Bate, Oey, 
yori, yed, 


see Figure 11.4, where the graphs of the generators A and B themselves 
are drawn with solid lines, and the graphs of their inverses are drawn 
with dashed lines. 

We shall denote the image of the group F under the change of 
coordinates (11.5) by PEP ,Le., F is the subgroup of PL F(R) generated 
by the transformations A and B. 


2.4. Intrinsic Description of F 


One can easily see that F, as a subgroup of PLF(R), consists precisely 
of those transformations g € PLF(R), for which the following three 
conditions are satisfied: 


(i) The discontinuity points of the derivative g’ are all dyadic rational. 

(ii) The slopes of g are integer powers of 2. 

iii) For a sufficiently large M > 0 the transformation g has the form 
y larg g 
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Figure 11.4 The graphs of the standard generators of the group F 


gvy)=y+C_ for y €(-o,-M), 
and 
gy)=vt+C, for y€(M,oo), 
where the constants C_ = C_(g) and Cy = C+(q) are integers. 


Clearly, the maps C+ -F — Z are group homomorphisms, and for the 
generators of the group F 


C_(A) = C,(A) =-1, 


whereas 
C(B)=0, C(B)=-1, 


so that in terms of the homomorphisms xq, xp : F = F - Z introduced 
in Section 2.2 


C_-=-xXq and CrL=-—xXa-X- 


3. Random Walk on the Schreier Graph [ = Z [5] 


3.1. The Dyadic-Rational Schreier Graph 


For technical reasons (since traditionally one considers right random 
walks on groups, for which the increments are added on the right), from 
now on we shall use the postfiz notation for the group F’. Thus, the group 
operation in F' will be defined as 


(mg)(Y) = gp(ny)) ; (11.6) 


(rather than the more common (g192)(y) =_gi(g2(y)) in the prefix 
notation which we used in Section 2.1). Then F' will naturally act on R 
on the right as 
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yvo=gy), veR geF. (11.7) 


Our notation agrees with the one used when dealing with random 
walks on groupoids (in particular, on the groupoids associated with group 
actions), see Kaimanovich [Kai05]. 


The set of dyadic rational numbers Z [3] C R is obviously transitive 
for this action, so that Z [3 can be endowed with the structure of the 
Schreier graph T° of the group F with respect to the generating set 

K ={A,B, A,B}. (11.8) 


3.2. Simple Random Walk on the Schreier Graph 


The simple random walk on the graph I’ has the transitions 


ug) 
yey.g, (11.9) 


where «4 = wxK is the probability measure equidistributed on the 
generating set K, 1.e., 

ee 

~~ A(y) 

a gs 

wn AH) 
a 

~» By) 

1 

Dy A: 
~~ BMY). 


More concretely, depending on the position of the point y these 
transitions are 


1 
2 
1 
ye} YF ye 0,0)U[2,00), 
aimed 
1 
eye | 
1 
4 
ae os 
Yo Ss 4 , yelO;4], 
nds y 
1 
As Qy 
1 
Soy +1 


1 
yuma sy 1) y €[1,2]. 


i 
4 
Pte: 


2 
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We shall denote by Q the probability measure on the path space of the 
simple random walk on I’ corresponding to the starting point 0. 


Let |y|2 be the usual 2-adic norm of a number y € T = Z [3]. L.e., 
lviIg=2-" fory = (2m+1)2”, 
and |0|2 = 0. 


Theorem 3.1. For Q-a.e. sample path (yn) of the simple random walk 
(11.9) on T 


lynl2—- oO. 
n> 0o 
As a consequence we immediately obtain 
Theorem 3.2. The simple random walk on T is transient. 


Proof of Theorem 3.1. Let us first of all notice that, outside of the 
interval (0,2), the transition probabilities of the random walk (11.9) 
are just 


PY,YHED=az. 


Therefore, the interval [0,2) is recurrent by the classical properties of 
the simple random walk on Z. Moreover, starting from any point y € 
[1,2) the random walk will eventually hit the interval [0,1) (and the 
corresponding hitting distribution is supported by the points y —1 and 7 
with the equal weights 5). Thus, the interval I = [0, 1) is also recurrent. 


The transitions of the induced random walk on J are, for y € [0. 3): 


ery 
i 
2 
any: 
OY PRO 
yu manta ds 
Ae} 
4 
Maa 
Le., 
3 
8 
aaa! A 
ES 
we F 
| ied 1 , 
aes Yad 
2 
i 
4 
Oa 


NS w 
x 
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and, for y € E 1), 


i 
4 
ae 
As y tl ws 1 i 
2: y 1 
nay Xo td 
YEeN? 1 ‘ vi 
ati. 
wet 
i 
z mm 2y -—1 
me Qy ~~ 1 
Rey 
i.e., 
1 
2 
a 
1 
wef 
aod i 
8 
rbte 
neo dy — 1 


Therefore, for y € I \ {o, 3| the logarithm x = loga|y|2 of the 2-adic 


norm changes as 


3 

Pee 

3 1 
TO Sort» YE 0,5 > 

i 

mse —1 

Aly 

2 

AST 

3 1 
Sy honoed Seg tl» YE 571 5 

1 

eg —1 


i.e., these transitions on Z, have a uniform positive drift, which implies 
the claim. 


Remark 3.3. It might be interesting to study the properties of the 
induced random walk on J and of its stationary measure (which is most 
likely unique). 
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3.3. Comparison Criterion for Transience of Markov Chains 


We shall now use a comparison argument in order to deduce the 
transience of general (not necessarily reversible!) random walks on the 
Schreier graph I from the transience just of the simple random walk 
on I. Its idea goes back to Baldi-Lohoué—Peyriére [BLP77], and it has 
been quite popular ever since (e.g., see Varopoulos [Var83], Chen [Che91], 
Woess [Woe94] as well as the exposition in Woess’ book [Woe00, Sections 
2.C and 3.A]). For the sake of completeness we shall prove it here in the 
generality sufficient for our purposes by slightly modifying the arguments 
of Varopoulos from [Var83, Section 4]. 


Theorem 3.4. Let P and P’ be two Markov operators on a countable 
state space X with the respective transition probabilities p(-,-) and p'(-,-), 
and such that 


(i) P and P’ have a common stationary measure m. 
(ii) The operator P is reversible with respect to the measure m, 1.e., it is 
self-adjoint as an operator on the space L?(X,m), or, equivalently, 


m(x)p(z,y) = my)piy,z) Wa,yex. 
(iii) There exists e > 0 such that P’ > &P, we., 
pi(z,y) 2 ep(z,y) Va,yeX. 


Then the transience of the operator P implies the transience of the 
operator P’. 


Proof. First of all, let us notice that the operator 


P'—eP 


= l-e 


is Markov and preserves the measure m. Therefore, for any 0 < 4 < 1 
and any f € L?(X,m) 


ACF, Qf)| < Ides QFYI < MFI? - 
It implies that 
(I -AaP)f,f) = ellfllp, = 9, (11.10) 
where (-,-) denotes the scalar product on the space L?(X,m), 
(f,9)P.a = Uf, I — AP)g) (11.11) 


is the positive definite bilinear form on L?(X,m) determined by the 
operator J — AP, and || - ||, || - l|p,, are the respective associated norms. 
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The Cauchy—Schwarz inequality for the form (-,-) p,, then implies that 
for any f € L?2(X,m) 


(2-aPy ff)? =(i-aAP) lf, - AP) - AP) "fy? 
S(= APS Pl =2Py Pps 
<2 -aPy "fp, - IG -APy FI, - 


By (11.10), 
| -aP’)"filb, < “(dr - MPO Fis 
whereas 
| - AP) Yb, = (7 -APY YF), 
so that 
(1 — AP’) lf, fy? “(dt - AP’) "f,f) (I—-aP) lf, fy. (11.12) 


Now, if f = 6, for a point o € X, then 


1 
Gpy(0, 0) = AG AP)~*8o, 80) 


is the A-Green kernel of the operator P at the point o, and the same 
holds for the operator P’. Since A < 1, 


0<Gpy(0,0) <0, 
whence by (11.12) 
Gp13(0,0) $ ~GP.3(0, 0) < —Gp.a(0,0), 
which, by letting 4 — 1, implies the inequality 
Gp1(0,0) 5 ~GP.1(0,0) (11.13) 


Therefore, the finiteness of Gp.j(0, 0) (= the transience of P) implies 
the finiteness of Gp 1(0, 0) (= the transience of P’). 


Remark 3.5. We are not aware of any ‘elementary’ proof of inequal- 
ity (11.13). 


Remark 3.6. The form (11.11) decomposes as 
(,-)Pra=Dpt+d—A)(,-), 


where Dp is the Dirichlet form of the operator P. 
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3.4. General Random Walks 


Given a probability measure yz on G we shall denote by (I, jz) the induced 
random walk on I’ with the transitions 


ear (11.14) 


In other words, the measure Q, on the space of paths of the induced 
random walk issued from a point y € I is the image of the measure P 
on the path space of the random walk (G, jw) (see Section 1.1) under the 
map 


8 = (In) v-8 = (¥-Ggn) - (11.15) 


Theorem 3.7. Let yx be a strictly non-degenerate probability measure 
on the group F. Then the induced random walk (1, ) is transient. 


Proof. ‘This is a direct application of Theorem 3.4. By the strict non- 
degeneracy of the measure yw, for any g € F there exists n = n(g) such 
that the n-fold convolution u*"(g) is strictly positive. Let 


n=max{n(g):ge Kk}, 
and 
6 =min{u*"(g): ge K}>0, 


where K is the generating set (11.8). Then the measure 


2 k 
Ta 
k=1 


w= 


i 


has the property that 


u(gy=euk(g) VgeEKk, (11.16) 


with e =d/n > 0. 

Let us now take for P (respectively, P’) the Markov operator of the 
simple random walk (T, wx) (respectively, the operator of the random 
walk (I, ’)). If we take for m the counting measure on I’, then conditions 
(i) and (ii) of Theorem 3.4 are obviously satisfied, whereas condition (iii) 
follows from inequality (11.16). Thus, Theorem 3.2 implies transience 
of the random walk (I, uw’), and therefore of the random walk (I, 2) 
as well. 


Remark 3.8. One should be able to significantly relax the condition 
sgr 4 = F imposed on the measure yw in Theorem 3.7. We expect it to 
hold just under very mild assumptions on the group gr w. 
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4. Non-trivial Behaviour at Infinity Determined by 
Stabilizing Configurations 


4.1. Groups of Configurations on T 


Let fun(T, Z) (respectively, Fun(I, Z)) denote the additive group of finitely 
supported (respectively, of all) Z-valued configurations on the graph T, cf. 


Section 1.7. By using the right action (11.7) of the group F’ onl’ = Z [3 


we shall define the associated left action of F on the configuration space 
Fun(T, Z) as 


Soy=9.9=9Y), geF, yer. (11.17) 


4.2. Configurations Associated with the Elements of F 


Given a transformation g € F , we shall denote by C, ¢€ fun(I’,Z) the 
associated configuration on I’ defined as 


Co(yv) = logs g'(vy +0) —logg g'(v-0) = Vy eT. 


In other words, Cg(y) is the difference between the base 2 logarithms of 
the left and the right slopes of g at the point y. The support of Cg is 
precisely the set of break points of g, i.e., the set of discontinuity points 
of the derivative g’, see Figure 11.5. 

Informally we shall say that the value Cg(y) is the logarithmic increment 


of the slope of g at the point y. The ratio g(y+0) appears in the literature 
9 J(y-0) 


on Thompson’s groups (e.g., see Liousse [Lio08]) under the name of the 
jump of g at the break point y, so that in these terms C,(y) is the 
logarithm of the jump of g at y. 

Remark 4.1. Obviously, two different transformations g1, 92 € F give 
rise to the same configuration if and only if g = 91. + C for Ce Z 
(for, if the difference between two functions from F is a constant, then 
this constant must be an integer, see Section 2.4). On the other hand, 


Figure 11.5 The configuration on T = Z [3] associated with an element of F’ 


TTHOMPSON’S GROUP F' IS NoT LIOUVILLE 323 


not every configuration g € fun(I’, Z) corresponds to an element g € F. 
There are two conditions, whose combination is necessary and sufficient 
for that. The first condition is 


>> oy) =0 


yel 


(for, any g € F has slope 1 both at —oo and at +00, so that the sum of 
logarithmic increments of the slope must be 0). It guarantees that there 
exists a transformation g € PLF(R) with 


g(t) =t+ CL at — oo, 
g(t) =tT+ Cy at +o, 


where the difference C.—C_ is uniquely determined by the configuration 
gy. Now, the second condition is Cy — C_ € Z (cf. Section 2.4). For 
instance, the configuration g = —d5p9 + 6; defined as 


—l, y=0, 
gyy= jtl, y=l, 


0, othwerwise 


satisfies the first condition, but not the second one; the resulting 
transformation g € PLF(R) is (up to an additive constant) 


Y y <0, 
gVY)=45; Ova; 
Y pas 


4.3. Composition of Configurations 


By the chain rule 


(9192) (7) = 9) - 92(01(7)) 
(keep in mind that the group multiplication (11.6) in F’ is defined by 
using the ‘inverse composition’). Thus, 
Cam) =Ca(Y tony) VogeFyer, (11.18) 
or, in terms of the action (11.17), 
Cog =Co +8%Cy VorgeF. (11.19) 


Remark 4.2. Formula (11.19) looks very similar to the ‘configuration 
component’ of formula (11.3) for the multiplication in the lamplighter 
groups. Note, however, that the actions T (11.2) and S (11.17) on the 
respective configuration spaces, which appear in these formulas, are quite 
different. The action T is determined by the action of the group on itself 
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on the left, whereas the action S is determined by the action on Tr = Z [3 
on the right. In particular, as a result of this 
T5p = Soh VgheG 
in the lamplighter setup of Section 1.7, whereas for the action (11.17) 
$96, = by g-1 VgeF,yer. 


Formula (11.19) allows one to define yet another (‘skew’) left action 
of the group F' on the configuration space Fun(T, Z) as 


(g,C) > Cg +S9C. (11.20) 


This action is similar to the natural action of the lamplighter group £(G) 
on the configuration space Fun(G, Z/2Z) defined as 


(g,®)\C=O4+TIC. 


It is this action that we mentioned at the end of Section 1.8 saying 
that it is determined by the ‘configuration part’ of the multiplication 
formula in the group £(G). Note, however, that in the lamplighter case 
the component g of a group element (g, ®) € £(G) is responsible just for 
‘translating’ a configuration C, whereas the component ® is responsible 
just for adding an ‘increment’ to C. On the contrary, there is no such 
‘splitting’ in the case of the action (11.20). 


Lemma 4.3. The action (11.20) of the group F on Fun(Y,Z) has no 
fixed points. 


Proof. Let C € Fun(I,Z) be a fixed point of the action, i.e., 
C=Cg+S9C VgeF. 


Since the configuration Cz associated with the generator A of the 
group F' (see Section 2.3) is empty, the configuration C must be then 
invariant with respect to the transformation $4 determined by the action 
§ (11.17). In other words, C must be periodic on T = Z [3 with period 1. 


In the same way, by taking g = B we arrive at the condition 
C=Cz+S8%C, 
where 
CR = —S0 + 52 


by the definition of the generator B (see Section 2.3). Since B acts 
trivially on Z [3 M (—oo, 0], it implies that C cannot be 1-periodic, 


whence a contradiction. 
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4.4. Stabilization of Configurations 


Theorem 4.4. Let uw be a finitely supported strictly non-degenerate 
probability measure on the group F. Then for a.e. sample path (gn) of 
the random walk (F, LL) the configurations Cn = Cg, pointwise converge 
to a limit Z-valued configuration Coo € Fun(T, Z), and the space Fun(T, Z) 
endowed with the arising limit distribution 4 is a non-trivial .-boundary. 
Therefore, the Poisson boundary (0, F',v) itself is also non-trivial. 


Proof. By formula (11.18), for any y € T 
Cnty) = Cn (VY) + Cari (V-9n) ; 


where, as usual, (hy) is the sequence of increments of the random 
walk (gn). Since supp is finite, the supports of all configurations 
Cp, h € supp, are contained in a certain finite set A C I’. Therefore, 
by Theorem 3.7 the sequence (C;(y)) almost surely stabilizes for any 
y €T, i.e., it converges to the limit configuration 


Coo = limCn = Ch, + 87 Chy +87 Cng + -+- - 


Then the resulting measure v on the space Fun(I, Z) is u-stationary with 
respect to the action (11.20), cf. Section 1.8 for the lamplighter case. 
Now it remains to show that the limit configuration Co cannot be 
the same for a.e. sample path (gn), i.e., that there does not exist a 
configuration C € Fun(I,Z) such that almost surely Coo = C. Indeed, 
since ser = F , if this were the case, then the limit configuration C 
would have necessarily been a fixed point of the action (11.20) of the 
group F on Fun(l, Z). However, this is impossible by Lemma 4.3. 


Remark 4.5. The limit distribution 2 constructed in Theorem 4.4 is 
concentrated on the subset of Fun(I’, Z) which consists only of infinitely 
supported configurations. The reason is that its complement fun(I’, Z) is 
countable, whereas it is well known that any non-trivial -boundary is 
purely non-atomic, see Kaimanovich [Kai95]. 


5. Geometry of the Schreier Graph [ 


5.1. The Graph I 


Our proof of Theorem 3.2 on the transience of the simple random walk 
on the Schreier graph T = Z [3 of the group F was based on purely 
probabilistic considerations, and this is the argument we had referred to 
in [Kai04a]. 

Shortly thereafter Savchuk independently analyzed the geometry of 
the Schreier graph of the original Thompson group F’ (endowed with the 
standard set of generators) on the dyadic-rational orbit in the interval 
[0,1], and obtained its complete description [Sav10b] (also see [Sav15] 
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wth Hi XiNI- AWAKE OX 


Figure 11.6 The Schreier graph I of Thompson’s group F' = F onthe dyadic- 
rational orbit in R 


for the Schreier graphs on the other orbits of the canonical action of 
the group F’). Since the dyadic-rational orbit of our group F' is precisely 
the image of the dyadic-rational orbit of the original group F' under 
the coordinate change (11.5) (see Section 2.3), this Schreier graph is 
isomorphic to our graph I. 

Figure 11.6 is a somewhat modified version of the picture of the 
graph I which first appeared in [Sav10b] (I am most grateful to Dmytro 
Savchuk for sharing his graphics source files with me). 

The labelling of the vertices of [ on our Figure 11.6 corresponds to 


the action of F on lr = Z [3 rather than to the original action of the 


group F on Z [3 |no, 1) as in [Sav10b]. Following [Sav10b], we use three 
different colors for the vertices of the graph I in order to better exhibit 
its structure: black for y € Z [3] MN (—oo, 0], grey for y € Z [3 N (0,1), 


and white for y € Z [3 M [1, 00). 

The solid (respectively, dashed) arrows represent the_oriented edges 
in I’ corresponding to the generator A (respectively, B), see Section 
2.3. As is customary for Cayley and Schreier graphs, labels are assigned 
to oriented edges, so that the label of the same see eed with the 
opposite orientation is the letter of the alphabet {A, A 1 B, B- 1) inverse 
to the label of the original edge. 

The ‘open arrows’ < (respectively, <=) on Figure 11.6 represent the 
infinite rays of negative (respectively, positive) type, see Figure 11.7 and 
Figure 11.8, respectively. These rays are the connected components of 
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Figure 11.7 Negative type rays in the Schreier graph T 


y 


yb 1 y+2 y+3 


Figure 11.8 Positive type rays in the Schreier graph T° 


the subgraph of I with the vertex set (—oo, 0] N Z [3 (respectively, 
[2,00) NZ [3)) attached to the starting points from the set (0, 1]NZ [3 


(respectively, [1,2) NZ [3)). The only vertex of I’, to which two rays are 


attached (both a negative type and a positive type ones), is 1. Note that 
by the presence of these two families of rays the structure of the graph 
I is really begging for applying the coordinate change (11.5). 


5.2. The Skeleton 


The skeleton I of the Schreier graph I is the tree obtained by removing 
from I all the rays of negative and positive types, see Figure 11.9. The 
tree I is obviously roughly isometric to the usual rooted binary tree. 
Indeed, let B be the binary tree which consists just of the grey vertices 


of T (i.e., of the points from (0, 1)NZ [3)): Then I and B differ only by 


the presence of an additional edge (the one between the vertices 1 and 5) 
and of additional white vertices which subdivide some of the edges of B 
into two halves. 


5.3. Transience of T 


Since the rays removed when passing from I to [ are clearly recurrent 
for the simple random walk (we have already referred to this property 
in the proof of Theorem 3.1), the transience of the simple random walk 
on the Schreier graph is then a direct consequence of the classically 
known transience of the rooted binary tree (e.g., see Woess [Woe00]). 
This argument is self-evident, once the geometry of the Schreier graph 
I has been exhibited. Although the statement about the transience of I 
does not explicitly appear in Savchuk’s paper [Sav10b], its author was 
aware of it and mentioned it in some of his talks given at that time. 
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Figure 11.9 The skeleton I of the Schreier graph T° 


5.4. Space of Ends of T 


Let dF denote the space of ends of the graph I’. It obviously splits into 
a disjoint union 
Fr Ss Vea . (11.21) 


Here 0_T and 0,T are the infinite sets of ends determined by the rays 
of the negative and the positive types, respectively, and dT is the space 
of ends of the skeleton I. 

The sets d_T’ and 0,T can be identified with the respective subsets 
of the skeleton TP, to which the corresponding rays are attached, i.e., 
with Z [3 M (0,1] and with Z [3] M [1,2), see Section 5.1. Note that 
the topology of the union (11.21) is the same as that of the end 
compactification of the skeleton T (with the only difference that the 
point 1 ¢ Tl appears with multiplicity 2). 

In what concerns dF’, as a topological space it can be easily identified 
with the space Ae = Zo \ 2Z2 of 2-adic integers of norm 1. For doing 
that let us first notice that the downward branchings in the ‘grey’ binary 
tree B (see Section 5.2) consist in applying the maps 


Y yi 
Oe aa and [Avie Bs 3 
or, equivalently, 


1 
fiver bes with e¢€ {0,1}. 


THOMPSON’S GROUP F' IS NoT LIOUVILLE 329 


Therefore, any geodesic ray y = (71, y2,-...) in the tree B issued from 
the point yy = 5 can be encoded by the corresponding sequence ¢ = 
(€1,€2,...), where €,, are uniquely determined by the condition 


Yn+1 = fen (Yn) 


so that 
lynl2 = 2” Vn=1. 


Then the sequence 
lYnl2°Y¥n = shee gs apes a i OR 


obviously converges in the 2-adic topology to the 2-adic integer 


[o@) 
1+ > 0 en: 2” 
n=1 


of norm 1. Conversely, any 2-adic integer of norm 1 gives rise to the 
corresponding geodesic y = (y1,y2,-..) in the tree B, i.e., to an end 
of T. 


5.5. Sections of the End Bundle 
By 
Fun(l, ar) & cary? 


we shall denote the space of dI’-valued configurations on I’, or, equiva- 
lently, of sections of the end bundle 


lxoro’r, (v,o)>Y, 


over the graph I’ (cf. the definitions of the hyperbolic boundary bundles 
and of the Poisson bundles in author’s papers [Kai04b] and [Kai05], 
respectively). One can also consider the configurations from Fun(I, dP) 
as end fields on I. In the same way as the space Fun(I, Z) of Z-valued 
configurations on I’, the space Fun(I’, dP) is endowed with the left action 
(11.17) of the group F’. 


5.6. Partial Isometries of T 


Generally speaking, groups do not act on their Schreier graphs by 
graph automorphisms (here and below when talking about graph 
automorphisms or isomorphisms we mean the maps which preserve the 
Schreier labelling of edges). In particular, in our case the graph I and 


its skeleton I’ are both rigid, i.e., their groups of automorphisms are 
trivial. However, in spite of that the graph I still has certain symmetry 
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properties sufficient for our purposes. Namely, it has a rather rich set of 
partial isomorphisms, i.e., of graph isomorphisms between its subgraphs. 

First of all, obviously, all the rays of negative (respectively, of positive) 
type are pairwise isomorphic. Further, for a tree T and any two vertices 
0 #0’ € T let To_,o' denote the shadow of the vertex o’ as seen from 
the vertex 0, i.e., the subtree of T’ which consists of all vertices x € T 
such that o’ lies on the geodesic [o, x]. Then the skeleton I is self-similar 
in the sense that the shadows T',_, 1 and T'j_,, are isomorphic for any 


yeZ [3 N(0, 1) (i.e., for any ‘grey’ vertex y € T’). If now T'y_,, denotes 
the subgraph of I’ obtained by reattaching all the negative and positive 
type rays to the vertices of the shadow I'j_., C I’, then the subgraphs 


ly, CT are all pairwise isomorphic for y € Z [3 M (0,1). 


6. Non-trivial Behaviour at Infinity Determined by 
the Boundary of the Schreier Graph 


6.1. Mishchenko’s Work 


In a recent preprint [Mis15] Mishchenko gives yet another proof of the 
transience of the Schreier graph I’ different from the proofs described in 
Section 3.2 and Section 5.3 (it is based on an explicit estimate of the 
Dirichlet norm on I’). He further notices that due to a specific geometry 
of I this transience implies existence of a non-trivial behaviour at infinity 
(= non-triviality of the Poisson boundary) for the simple random walk 
on I’, and therefore for the simple random walk on Thompson’s group F 
as well (cf. the discussion of the relationship between the simple random 
walks on F and on F & F in Section 3.2). Thus, the Poisson boundary 
of the simple random walk on Thompson’s group F is non-trivial [Mis15, 
Theorem 2.5], which provides another proof of our result on the absence 
of the Liouville property for the group F’. 

The aforementioned geometric argument in [Mis15] is based on the 
following observation (Theorem 2.3), which we shall quote here in a 
slightly modified form: 


If a tree T has the property that there exists a vertex o € T of degree 
at least 2, and such that for any neighbour o’ of o the shadow T_; 0! 
is transient, then the Poisson boundary of the simple random walk on 
T is non-trivial. 


In fact, it is classically known (e.g., see Woess [Woe00]) that the 
transience of the simple random walk on a connected graph implies 
convergence of its sample paths to an end of this graph. Moreover, the 
same is true for any bounded range random walk, and, as we have already 
established in Theorem 3.7, the random walk on I’ determined by any 
strictly non-degenerate probability measure yz on F is transient. 


THOMPSON’S GROUP F' IS NoT LIOUVILLE 331 


Yet another point is that the boundary behaviour provided by the 
above argument from [Mis15] depends on the starting point y € I’ of 
the induced random walk, so that the resulting quotient of the Poisson 
boundary is not a u-boundary. Actually, the quotient of the Poisson 
boundary produced by [Mis15, Theorem 2.3] is finite (it can be identified 
with the set of neighbours of the vertex o), so that it cannot be a 
#-boundary already for this reason (for, as we have already mentioned in 
Remark 4.5, any non-trivial u-boundary is purely non-atomic [Kai95]). 


6.2. Non-trivial ~-Boundary 


Theorem 6.1. Let wu be a strictly non-degenerate finitely supported 
probability measure on the group F. Then for a.e. sample path g = (gn) 


of the random walk (F, 1) and any pointy ET =Z [3] the sequence 


Y-9n converges to an end wy = w,(g) € OI, which gives rise to a map 
gt> (@y)yer € Fun(l, od.) 


from the path space (FZ+ ,P) of the random walk (F, 1) to the config- 
uration space Fun(l,0I), and the space Fun(I’,0I) endowed with the 
arising probability measure 24 (the image of the measure P on the path 
space under the above map) is a non-trivial 4-boundary. 


Proof. As we have already noticed, Theorem 3.7 in combination with 
the fact that the support of yw is finite implies the convergence 


V-In > @, € OT for all y € T and ae. sample path (gp) . 


Thus, it only remains to show that the distribution of the configurations 
(@y)yer is not a point measure. 

First of all let us notice that if the configuration (@,) were the same 
for a.e. sample path (gp), then it would necessarily have been constant 
(i.e., taking the same value at all points of I), because the group F' acts 


on Fun(l, oP) by ‘changing’ the lr = Z [3 variable, and sgr uw = F. 

Now let us assume that @, = € dl. It means that for any starting 
point y € T almost every sample path of the random walk (T, w) (11.14) 
converges to w. Let us denote by Qy the measure on the space of sample 
paths of this random walk whose initial distribution is the counting 
measure # on TI’. 

If w € d0_T UdxI is the end corresponding to a ray of the negative 
or of the positive type, then there is a Qy-non-negligible set of sample 
paths which are entirely contained in this ray. However, since all the rays 
of the same type are isomorphic (see Section 5.6), the same would be 
true for any other ray of the same type, which means that with positive 
probability the limit end would be different from w. 

If w € dF is askeleton end, then, in the same way, the set of the sample 
paths entirely contained in the subgraph ee is Q-non-negligible. 
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In view of the self-similarity of the graph I’ (see Section 5.6) it implies 
that the same would be true for any subgraph Iy_,,,. However, one can 
obviously choose y in such a way that w ¢ dFy,,. 


The construction of a u-boundary of the group F on the configuration 
space Fun(I,Z) from Theorem 4.4 heavily used the specifics of this 
group. On the contrary, the above construction of a z-boundary on the 
space of sections Fun(I’, dP) of the end bundle over the Schreier graph 
is much more general. 


Theorem 6.2. If a Schreier graph T of a finitely generated group G 
is transient (with respect to the simple random walk on it), then for 
any strictly non-degenerate finitely supported probability measure on 
G either 


(i) there exists an end w € OY such that for any y €T a.e. sample path 
of the induced random walk (1, 2) issued from y converges to w, 


or 


(ii) the space of sections Fun(T, dV) of the end bundle over T is a non- 
trivial u-boundary, so that the Poisson boundary of the random walk 
(G, ) is also non-trivial. 


Proof. Inthe same way as in the proof of Theorem 3.7, the transience of 
the simple random walk on I’ implies the transience of the induced ran- 
dom walk (IT, 2) for any strictly non-degenerate probability measure wu 
on G. Further, if w is finitely supported, then a.e. sample path of the 
induced random walk on I converges to a random end w € dF. Let us 
denote by {x,},er the family of the corresponding hitting distributions 
on the space of ends dl. By the strict non-degeneracy of mw, all the 
measures x, are pairwise equivalent. Now, in case (i) all measures x, 
coincide with the delta-measure 46,. Otherwise, in case (ii), all measures 
%, are not delta-measures, so that Fun(I’, dP) endowed with the arising 
limit distribution is a non-trivial u-boundary in the same way as in 
Theorem 6.1. 


6.3. Convergence to Components of the End Space 


The key ingredient of our proof of Theorem 6.1 was the observation 
that for a finitely supported measure w on F' the transience of the 
random walk (I, w) (11.14) implies the convergence of a.e. sample path 
to a random end of the graph T’ (also see Theorem 6.2). By using the 
homomorphisms 


Xa Xp: F = FOZ 
introduced in Section 2.2 one can easily describe the components of the 


decomposition (11.21) of the space of ends dF, on which the hitting 
measures x, of the random walk (I’, z) are actually concentrated. 
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Since the measure yu is finitely supported, the restriction of the random 
walk (I, 4) to any of the negative (respectively, positive) type rays (see 
Section 5.1) coincides, outside of a finite neighbourhood of ray’s origin, 
just with the usual translation invariant random walk on Z. Here and 
below we identify the negative (respectively, positive) type rays in [ 
with the negative (respectively, positive) integer ray Z_ = {..., -—2, —1, 0} 
(respectively, Z4 = {0,1,2,...}) of Z. As it follows from the description 
of the graph I (see Figure 11.6), the step distribution of the induced ran- 
dom walk on Z_ (respectively, on Z+) is (—xXa)(u) (respectively, (—xa — 
Xv) ()), where by x (i) we denote the image of the measure jz under a 
group homomorphism x : F > Z. Therefore, the drift of the induced 
random walk on Z_ (= on the negative type rays) is the barycentre 


(-xa)(u) = —-a , 


and the drift of the induced random walk on Z+ (= on the positive type 
rays) is the barycentre 


(-Xa — Xp) (u) = —a—B, 
where 
a=Xa(u) and B= xp(K) 


denote the barycentres (the expectations) of the measures xq(w) and 
Xp(u) on Z, respectively. 

It is classically known that the recurrence properties of the random 
walk on Z governed by a finitely supported step distribution o are 
completely determined by the barycentre o. In particular, the ray Z_ 
(respectively, Z,) is transient (i.e., the random walk escapes to infinity 
along this ray) if and only if the drift & is negative (respectively, positive). 
Therefore, we arrive at the following description of the components 
of the decomposition (11.21), on which the hitting measures x, are 
concentrated, in terms of the parameters a, 8 € R (also see Figure 11.10): 


Proposition 11.22. Under the conditions of Theorem 6.1 


(i) Ifa > 0 anda+f8 < 0, then both the negative type and the 
positive type rays are transient, and the hitting measures x, are 
concentrated on the union d_T Ud,T. 

(it) Ifa > 0 anda+ pf = 0, then only the negative type rays are 
transient, and the hitting measures x, are concentrated on 0_T. 

(iui) Ifa < 0 anda+f8 < 0, then only the positive type rays are 
transient, and the hitting measures x, are concentrated on 0,1. 

(iv) If a < 0 anda+B > 0, then neither the negative nor the 
positive type rays are transient, and the hitting measures xy are 
concentrated on dr. 


Proposition 11.22 allows us to make more precise the description of 
the u-boundary of the random walk (F,, 2) obtained in Theorem 6.1. 
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aTua,r 


Figure 11.10 The domains in the parameter plane (@,f) corresponding to 
different supports of the hitting measures x, 


Theorem 6.3. Under conditions of Theorem 6.1 the arising probability 
measure ». is concentrated on 


Fun(’,d_[U0,T), itfa>0, anda+f <0, 


Fun(T, 0_T) , ifa>0, anda+B>0, 
Fun(T, 0,1) , ifa<0, anda+B <0, 
Fun(l, aP) , ifa<0, anda+fB>0. 


7. Concluding Remarks 


7.1. Relaxing Conditions on the Step Distribution 


Throughout the chapter we have used the assumption that the measure wu 
on the group F = Fis finitely supported and strictly non-degenerate. It 
would be interesting to see, to what extent our results can be carried over 
to more general measures. [Let us remind once again that amenability 
of the group F' is equivalent to existence of a Liouville symmetric 
probability measure w with supp u = F'| 

Here one can try to relax both the conditions on how much the measure 
ju is allowed to spread out: 


finite support 
1 


finite first moment 


4 
finite entropy 


Y 


general measures 
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and the conditions on how non-degenerate the measure yz is allowed to be: 


ssru=F 
v 

eru=F 
v 


gr pl is a non-elementary subgroup of PF. 


The meaning of ‘non-elementary’ of course needs to be specified in our 
context (for instance, cf. Kaimanovich—Masur [KM96] for the subgroups 
of the mapping class group, or Kaimanovich [Kai00] for the subgroups 
of groups with hyperbolic properties). 

As for the conditions on how much the measure mw is allowed to 
spread out, one can expect that an extension to the class of measures 
with a finite first moment should be quite feasible (cf. the case of the 
lamplighter groups considered by the author [Kai91] and the case of 
hyperbolic graphs considered by Kaimanovich—Woess [KW92]). Notice, 
however, that, once the finite first moment condition is dropped, the 
‘hyperbolic’ and the ‘lamplighter’ models drastically diverge. Boundary 
convergence in hyperbolic spaces does not require any moment condi- 
tions, see Kaimanovich [Kai00], Maher—Tiozzo [MT 14], whereas for the 
lamplighter groups, due to their amenability, the situation is completely 
different. 

For instance, there are examples of measures with infinite first 
moment on the lamplighter groups, for which the configurations do not 
stabilize, but the Poisson boundary is still non-trivial, see Kaimanovich 
[Kai83, Kai91]. In fact, Erschler proved in [Ers04b] that the Poisson 
boundary on the lamplighter groups is non-trivial for all step dis- 
tributions with finite entropy provided the quotient random walk is 
transient. She also described another family of groups (which she calls 
groups of Baumslag type) with this property. These groups are quite 
similar to the lamplighter groups, but the difference is that for the 
Baumslag type groups certain infinitely supported configurations are also 
allowed. 

Outside of the class of measures with finite entropy the situation 
becomes even more complex. There are examples of measures fu on 
the lamplighter groups, for which the Poisson boundary is trivial in 
spite of non-triviality of the Poisson boundary for the reflected measure 
iu(g) = w(g-'), see Kaimanovich [Kai83]. This is impossible for the 
measures with finite entropy because of the entropy criterion of triviality 
of the Poisson boundary, as in the finite entropy case the asymptotic 
entropies of the original and of the reflected random walks are obviously 
the same. 
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7.2. Other Subgroups of PLF(R) 


It would be interesting to understand, to what extent our results can 
be carried over to other subgroups of PLF(R), for instance, to the 
group generated by the translation y % y +1 and the transformation 
defined as 


Note that by a result of Brin—-Squier [BS85] PLF(R) (and therefore all 
its subgroups as well) does not contain non-abelian free subgroups. 

As it has been pointed out by the referee, our ‘drifting’ technique 
used in the proof of Theorem 3.1 should be applicable to the ‘F-series’ 
of the so-called Thompson-—Stein groups (they are a generalization of 
Thompson’s groups introduced by Stein [Ste92]). In particular, it almost 
verbatim carries over to the groups F'(p) introduced by Burillo—Cleary— 
Stein [BCS01] (for these groups the break points are allowed to be in 
Zs] for an integer p > 1). An interesting feature of the family {F'(p)} 
is that all these groups embed one into the other, so that they are all 
either amenable or non-amenable simultaneously. 

We are not aware of any explicit description of the Schreier graphs of 
the canonical action of any of the Thompson-Stein groups other than 
Savchuk’s description of the Schreier graphs for the original Thompson’s 
group F [Savl0b, Sav15}. 


7.3. The Lattice of “-Boundaries and the Problem 
of Identification of the Poisson Boundary 


Although the j-boundaries constructed in Theorem 4.4 and Theorem 
6.1 seem to be quite different, currently we do not have any information 
about their mutual position in the lattice of all u-boundaries. In 
particular, a priori it is not excluded that both these u-boundaries might 
in fact coincide with the full Poisson boundary (which is the maximal 
element in the lattice of w-boundaries). If not, what are, for instance, 
their infimum and supremum? Can one be just a quotient of the other 
one? And, of course, how are they related to the full Poisson boundary? 
Will their supremum, one of them, or maybe even both be the Poisson 
boundary? See author’s papers [Kai96, Kai00] for a discussion of the 
general identification problem for the Poisson boundary. 


7.4. Configuration Spaces as “¢-Boundaries, and Generating 
Partitions of the Poisson Boundary. 


The p-boundary from Theorem 6.1 seems to be the first example when a 
boundary behavior is realized on the space of configurations on a single 
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group orbit in such a way that the group action consists of just permuting 
the configurations without changing their point values. 

There is a well-known construction in ergodic theory that allows 
one to realize general group actions as actions on configuration spaces 
by translations (e.g., see Cornfeld—Fomin-Sinai [CFS82]). Namely, let 
(X,m) be a Lebesgue measure space endowed with a measure class 
preserving action of a countable group G. Given an at most countable 
measurable partition €, let X; be the corresponding countable quotient 
space (i.e., the space of the elements of the partition €), and let z > &(z) 
be the corresponding quotient map which assigns to any point z € X 
the element of the partition € which contains xz. The function 


,:G>X%, gr ég's), 


can be then considered as an X¢-valued configuration on G, and the 
map zt  ®, is obviously G-equivariant if one endows the space of 
configurations on G with the action (11.2). If the map t + ®, separates 
(mod 0) the points of X, i.e., if this map is an isomorphism between the 
space (X,m) and the space of Xz-valued configurations on G, then the 
partition € is called generating. 

Generating partitions for Z-actions are a classical tool of ergodic 
theory. Moving to more general actions, it is, for example, easy to see that 
if one takes the partition of the boundary of a free group into the cylinder 
sets determined by the first letter of the infinite words representing 
boundary points, then this partition is generating even in the topological 
category. Therefore, this example provides a symbolic realization of the 
action of the free group on its Poisson boundary whenever the Poisson 
boundary can be identified with the space of infinite words (for instance, 
for the step distributions with a finite first moment, see Kaimanovich 
[Kai00]). However, we are not aware of any work on generating partitions 
for Poisson boundaries or their quotients in general. 


7.5. Poisson Boundary and Sections of Boundary Bundles 
over Schreier Graphs 


The statement of Theorem 6.2 on the construction of a non-trivial 
u-boundary for a group G from the space of ends of its transient Schreier 
graph I is, of course, very well known when one takes for T' just the group 
G (in which case the action of the group G on itself extends to the space 
dG, and the limit configurations are equivariant as functions from G 
to 0G). However, it would be interesting to have other examples in the 
situation when the stabilizers G,, y € I’, are sufficiently far from being 
normal. It is natural to impose the condition that the Schreier graph is 
determined by a core-free subgroup H C G (ie., that the action of G 
on the homogeneous space T = H\G is faithful), as otherwise one can 
always pass to the quotient of G by the normal core of H. Note that for 
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Thompson’s group F = F the latter condition is obviously satisfied as 
the canonical action of Thompson’s group is completely determined by 
its restriction to the dyadic-rational orbit (cf. Section 2.3). 

Instead of the end bundle one can use the Poisson bundle over the graph 
TP, ie., the map I x 0,0 — I’, where 4,7 denotes the Poisson boundary 
of the induced random walk ([,) on the graph I’. The space of its 
sections is the space of configurations Fun(l’, 0,1). Let bnd = bnd(r,) 
denote the boundary map from the path space of the random walk (I, 1) 
to the Poisson boundary 0,1. If dP is non-trivial, then the projection 


g = (gn) > {bnd(y.g)}yer ; 


where g +> y.g is the map (11.15) from the path space of the random 
walk (G, 2) to the path space of the induced random walk (IT, 2), makes 
Fun(l’, 0,0) a non-trivial -boundary. It would be interesting to find 
conditions under which this ~-boundary coincides with the full Poisson 
boundary of the random walk (G, j2). 
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According to our Theorem 6.1 the sample paths of the induced random 
walk on the canonical Schreier graph I’ converge to a limit random end 
for any non-degenerate finitely supported step distribution on Thomp- 
son’s group F’. However, Kate Juschenko and Tianyi Zheng have recently 
proved (“Infinitely supported Liouville measures of Schreier graphs”, 
arXiv:1608.03554) that this may fail if the finite support condition is 
dropped, namely, that there are infinitely supported measures yz on F’, for 
which the Poisson boundary of the induced random walk on IT’ is trivial. 
Their argument is based on the observation that the strong transitivity 
of the action of F on T implies a certain approximation property which 
can then be used for constructing the desired measure w in the same 
way as Reiter’s property was used for constructing a Liouville random 
walk on any amenable group by Kaimanovich-Vershik [KV83]. We shall 
return to the relationship between amenability and the approximation 
property of the action of F on I used by Juschenko—Zheng elsewhere. 
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This is a presentation of the subadditive ergodic theorem. A proof is 
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theorem. 
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1. Introduction 


Between the years 1938 and 1948 F. Riesz published works in ergodic 
theory [R] and found among other things several extensions of the mean 
ergodic theorem by elegantly simple and powerful arguments. In October 
1931, von Neumann had proved the basic version of the mean ergodic 
theorem [vN32] inspired by Koopman’s observations published in a note 
in June that same year [Ko31]. In his 1944 lecture notes [R44], Riesz 
gave an insightful exposition of these developments. In particular, he 
presented there a new proof of the maximal ergodic lemma needed for 
Birkhoff’s a.e. ergodic theorem, which is a deeper result than the mean 
ergodic theorem and dates November 1931 [B31]. This became one of 
the standard proofs of this theorem and was based on a lemma which 
is harder to formulate than to prove (see Lemma 3.2 below). He also 
remarked that his work on the mean ergodic theorem was suggested by 
the method of Carleman [C32]. In fact, Carleman announced in May 
1931 results similar to Koopman’s as well as a proof of the mean ergodic 


1 He was supposed to deliver these lectures in Geneva in the spring of 1944, but he 
was prevented from coming. 
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theorem. This announcement was published in June 1931 [C31] and the 
details can be found in [C32].? 

The main purpose of this note is to show how Riesz’s method extends 
to give a proof of the subadditive ergodic theorem of Kingman from 1968 
[K68]. This was a by-product of the work [KM99] and all the details of 
the proof are included. Note that this proof at the same time gives a 
proof of Birkhoff’s theorem. 

Let throughout this chapter (X,) be a measure space with w(X) = 
land JT : X — X a measure preserving transformation. Recall the 
following result: 


Theorem (Birkhoff, 1931). Let f € L'(X), then there is an integrable, 
a.e. T-invariant function f such that 


n—-1 
1 = 
jim, = Vie n= s@) 
k=0 


for a.e. x (the convergence also takes place in L+). In fact lf, < Wlfl 
and for any T-invariant set A, fy f = vee in particular [y f = ied: 
Note that if 


n-1 
c(n,c) =) f(T*a) (12.1) 
k=0 


then 
c(n+tm,xr) = c(n,x)+ c(m, T” 2), 


and we then call c an additive cocycle. These are all of the form (12.1), 
for f(z) = c(1, 2). 

If for a sequence of functions a(n,.) € L'(X), with integer n > 0 and 
a(0,z) = 0, we instead require 


a(n + m,2x) < a(n,z) + a(m, T”2), 


then a is here called a subadditive cocycle. Assume that 
1 
int— [ a(n, x)du(x) > —oo. 
NIX 


Then the following generalization of the Birkhoff ergodic theorem holds. 
Theorem (Kingman, 1968). Under the above conditions, there is an 
integrable, a.e. T-invariant function a such that 


1 
lim —a(n, 2) = a(x) 
no n 


2 Several early publications in the subject give due credit to Carleman, for example, 
E. Hopf’s important monograph Ergodentheorie from 1938. In most modern works, 
however, including the standard reference [Kr85], Carleman is not mentioned. 
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for a.e. x (the convergence also takes place in L'). Moreover 


lim ~{ a(n, )du(e) = f a(x) d(x) 
A A 


no n 
for all T-invariant measurable sets A. 


A draft of this chapter was written in 1998 when I was a graduate 
student at Yale University. Some people have since told me that my text 
has been useful to them, so I thought that it might perhaps be worthwhile 
to publish a revised version. In addition, I thank the referee for useful 
comments leading to an improved text. I dedicate it to Wolfgang Woess, 
whom I have had the pleasure of knowing for more than a decade. I 
have at various times benefited from his many insights within, as well as 
outside of, mathematics. 


2. A Few Examples of Subadditive Cocycles 


We give a few examples of subadditive cocycles to illustrate that 
Kingman’s theorem is a significant extension of Birkhoff’s theorem with 
many applications. In fact the origin of Kingman’s theorem comes from 
probability theory (theory of percolation) in the works by Hammersley 
and Welsh. 


2.1. Random Products in a Banach Algebra 


Let A: X > B be a measurable map into a Banach algebra. Let 
u(n, x) = A(T”-12) A(T"~22)... A(z), 
then 
a(n, £) = log |lu(n, x)|| 
is a subadditive cocycle, because || AB|| < || All || Bl]. The corresponding 
convergence was first proved by Furstenberg and Kesten in 1960 for 
random products of matrices, of course without the use of the subadditive 


ergodic theorem. This application is used in some proofs of Oseledets’s 
multiplicative ergodic theorem. 


2.2. Random Walks 


Let G be a topological group and h: X — G a Borel measurable map. 
Let v(n, 2) = A(T"! 2)... A(Tx)h(2), if {ho TR are independent, then 
this is usually called a random walk. The range, that is, how many points 
visited in G, 

a(n, x) := Card{v(i,2):1<1i< n} 


is a subadditive cocycle. 
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Assume that d is a left invariant metric on G, (e.g. a word metric in 
the case G is finitely generated) then the drift 


b(n, x) := d(e, v(n, x)) 


is a subadditive cocycle, by the triangle inequality and the invariance of 
d. A third source of subadditivity is the entropy of the measures of the 
distribution of random walk after n steps, see [D80]. 


2.3. Metric Theory of Continued Fractions 


See [Ba97]. Let X = [0,1) and A be the Borel o-algebra. For any xz write 
it as 
1 1 


Continuing this scheme gives the continued fraction expansion of z, 


4 


a(2) := [2] , etc. The relevant transformation of X is 


oly 


unless z = 0 in which case Tx = 0. So an (x) = a(T"! 2). 
The corresponding approximants pp(x)/qn(x) are defined for all n if 
x is irrational, they are given by recusion formulas, and by 


Dn (2) _ 1 


Qn(2) mand: 


1 
an (2) , 
There is a unique T-invariant probability measure absolutely continuous 
with respect to Lebesgue measure, namely 


1 1 
v(A) = ——~ 
log2 Ja l+a2 


az(x) + 


dx 


called the Gauss measure. 
By use of the subadditive ergodic theorem, Barbolosi proves that for 
all constants C' > 0, 


1 
lim ~Cavd {1 Sin: |rqi(x) — pi(x)| < 


n>o n 


qi(2) 


exists for a.e. x. The functions are not simply a subadditive cocycle; 
it is necessary to divide into odd and even index, due to the fact that 
approximants lie on alternating sides of x. 
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2.4. Further Examples 


There are too many applications of the subadditive ergodic theorem to 
list here; let us just refer to [H72] and [D80]. 


3. A Proof of Kingman’s Theorem 


3.1. Three Elementary Observations 


Proposition 3.1. Let (vn)n>1 be a subadditive sequence of real 
numbers, that 1s Unt+m < Un + Um. Then the following limit exists 


1 1 
lim —vp = nee —Um € RU {—oo}. 
NnVwo n m>0m 
Proof. Given ¢ > 0, pick M such that vy/M < infoun/n +e. 
Decompose n = kn M + rn, where 0 < rn < M. Hence kp/n > 1/M. 
Using the subadditivity and considering n enough (n > N(e)) 
1 1 1 


inf —Um < —Un = Uk, Mtr, (nom Ur, ) 
m n n 


i 1 
< vy te < inf —vp, 4+ 2e. 
7M <in "Om 


Since € is at our disposal, the lemma is proved. 


Note that this proposition implies that for any T-invariant set A, 


lim ~{ a(n, c)du(x) = int — f a(n, x)du(x), 
noon JA nna 
when X = A denote this value by y(a). 

F. Riesz noted in [R44] that the following simple lemma can be used 
to prove Birkhoff’s theorem. The lemma is sometimes called Riesz’s 
combinatorial lemma or the lemma about leaders. 


Lemma 3.2. Call the term cy a leader in the finite sequence 
CO, C1,---;Cn—1 Uf one of the sums 


Cu, Cu + Cutls-++) Cub o0+  Cn-1 


is negative. Then the sum of the leaders is nonpositive. (An empty sum 
is by convention 0). 


Proof. Proof by induction. If n = 1, then either co > 0, in which case 
the sum is empty, or cg < 0, in which case the sum equals co < 0. 

Assume that the statement is true for integers smaller than n. Consider 
the two cases, co is or is not a leader. If co is not a leader then all leaders 
are among Cj,...,Cn—1 in which case the induction hypothesis applies. 

If cp is a leader, then pick the smallest integer k such that cgo+---+c, < 
0, then each c;, 1 < k is a leader. If not then cj +---+ cy > 0, but by 
minimality co +---+ cj-1 => 0, which is a contradiction. 
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Hence co,..., ck are all leaders and co +---+ cy < 0, the remaining 
leaders (if any) are leaders of cx,...,Cn, for which the induction 
hypothesis applies. 


Proposition 3.3. Let a(n,x) be a subadditive cocycle as in the 
introduction. Then the functions 


1 
f(z) = limsup — a(n, x) 
n>oo 1 
and 
Shed 
g(x) = liminf —a(n, 2) 
no n 
are a.e. T-invariant. 
Proof. Note that f(Tx) => f(x) and g(Tx) > g(x) because of the 
subadditivity 
a(n, Tx) > a(n+1, 2) — a(1, 2) 
and in the case of limsup (same for liminf) 
; 1 1 
lim sup (| —a(n+ 1,2) — —a(1,z)}) = f(a). 
n n 
Now integrate 


iE [f(Tx) — f@)] du(a) =0 


by the J-invariance, but the integrand is nonnegative, hence f(T) — 
f(z) =0 ae. 


3.2. The Maximal Ergodic Inequality 


The following key lemma will be proved by an extension of the argument 
of F. Riesz. It thus avoids use of the usual maximal ergodic inequality 
and it is not more difficult than Derriennic’s proof. 


Lemma 3.4 (Derriennic, 1975). Let a(n,x) be a subadditive ergodic 
cocycle as in the introduction. Let 


1 
B= {« : liminf —a(n, x) < of 
nrewo n 
then 


n>o n 


1 
lim ~{ a(n,z)du <0. 
B 
Proof. For each n, let 


Vn = {« : inf a(n,x)—a(n—k, T*z) < of F 
l<k<n 
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Note that 
An = {2 : pant ae) < of CW 

by subadditivity. Note also that An C An41 and BCU Ap. 

For each n, let 

bn(x) = a(n, x) — a(n — 1, Tr). 

Because of telescoping we have that 

a(n, x) — a(n —k, T*x) = bn (a) + bn—1(Tx) +--+ + bn gyi (T* 2) 
and in particular, recall that a(0, x) = 0, 


nals °S> bn_E(T* a). 


O0<k<n-1 


By definition, T*x € Vy_, means that there is a j, k<j<n-—1such 
that 


by—n(T*a) +++: + bn_7(Ti 2) < 0. 
Hence by the lemma about leaders applied to cy = byn_4(T*x) we have 
x bn_p(T*x) < 0. 
O<k<n-1,Tkrewv,,_p 
Therefore, using the T-invariance of uw and B, 


02 De bn—K(T*a)du(a) = 


O0<k<n-1,Terew,_p 


Se) bya (TP) dita) = 
BOT-*kw 


O<k<n-1 nok 


= if bya a) q(x) = 
Tk Bow 


=f bilerduca. 
Bnwi 


t=1 
On the other hand, again by the T-invariance 


1 Le 
2 _z)du(r) = — b;(x)du(a) = 
: 18 a(n, 2)du(z) 2 yi (2)du(a) 
donee ee 
= —5 o bj (x) d(x) + — 5 i bi(a)du(x) < 
Ma? BOW n J JB-(BM;) 


1 n 
<0+- yf at (1, 2)du(n) 
n pal B-(BnAj) 
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since bj(x) < a(1,z) < at(1,z) := max{0,a(1,n)}, which is positive 


and A; C W,;. Now since at € L!, Aj C Ajy1 and BC Uss1 Aj it 
follows when taking limsup, that 


1 
lim sup ~[ a(n,xz)du <0. 
NJIB 


Since B is invariant, the limsup actually is the limit, by Proposition 3.1. 


Proposition 3.5. Let a(n, x) be a subadditive ergodic cocycle as in the 
introduction. Let 


1 
B= {« : liminf —a(n, x) < if 
n 


then 


d. 
lim -[ a(n, rz) du < Ap(B). 
B 


n>o n 


Proof. Apply the lemma to a(n, x)— nd, which is a subadditive cocycle. 


3.3. The Proof of a.e. Convergence 


First we establish the result for additive cocycles cp, = c(n, x). The point 
is that —cy is again additive, hence in particular the previous proposition 
applies to —cp as well. 

Let 


1 1 
Fa,p = {# : liminf—cn <a < B < limsup =e} 
n n 
and by Proposition 3.3 this set is T-invariant. Hence we can apply 


Proposition 3.5 with X = Egg. If we let EF := {2 : lim inf +c, < a}, 
then E'M Bag = Ea,g, and this gives 


1 
i. cjdu = lim ~| Crd < ap(Ha,g)- 
Eo3 no n Ev,p 


And similarly for —cp, 


n >on 


. 1 
-| cd = lim ~[ —Cn dp < —Bu(Ea,g)- 
Eup Ea,p 


This yields a contradiction unless 4(Ly,g) = 0, because 


bu Ey.g) < a erde < ap(Ey.g) 
al, B 


but B >a. 
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Now let an(x) = a(n,xz), be a subadditive cocycle satisfying the 
integrability assumptions in the introduction. Consider 


n-1 
Un(t) = an(z) — D> ay (T*a). 
1=0 
Note that un is a subadditive cocycle and un < 0. Let g(x) = 
lim inf + up (2) and f(x) = limsup + un (2). For an arbitrary a > 0, we 


want to show that B := {x : f(x) — g(x) > a} has measure zero. By the 
additive cocycle case above we know that pe, a,(T*x) converges 


a.e., so taken together this would show that + an(2) converges a.e. as 
desired. 
Fix e > 0. Pick M large enough so that for all m > M, 


1 
~ [im sy) te. 
mJx 
Let 
M (9) = liminf 1 
g” (%) = Timin a nM () 
and 


; 1 
{M (2) := limsup au nM (2). 
no 


On the one hand, because nM is a subsequence of n, we have that 
gM (cz) > g(x) and f@(z) < f(x). On the other hand, by the sub- 
additivity and nonpositivity of v,, for allO<k < M, 


M+k 
POETS aye qe (0) 


Unt) M(L) < Une (L) + vy_K( 
and 
UnM+k(£) < Unm (2) + v4 (T™ 2) < tn (2). 


Therefore in view of that (n+1)nM/(nM+k) > land nM/(nM+k) > 
1, we can conclude that gV (2) = g(x) and f(a) = f(a). Let 


n-1 
uM (2) = nm (2) — a um (T™ 2), 
1=0 


which again is subadditive and nonpositive. We then have that: 


1 
f—-g=f" ee lim inf Tae 


9 
nwo Nn 


because uM < 0 and the established convergence for additive cocycles. 
This means that 


1 
B:={x:f-g>a}c {« : lim inf — v(x) < -ma =: f. 
no n 
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Note that 
0> yw!) = My(v) -f vy = —Me. 
x 
Now applying Proposition 3.5 we get 
1 1 
—Map(E) > tim = f vu! > tim = f vM@ > —Me. 
NSE nSIX 
Hence 
g 
u(E) < — 
a 


and letting « > 0 we conclude that «(B) = 0 for any a > 0 as required. 
The limit is almost everywhere T-invariant, by Proposition 3.3 and 
integrable by Fatou’s lemma 


1 1 
J imint—<o, < liminé f ——o < OO. 
n n 
In general, { |a(n,z)| < nf |a(1,2)|. 
4. Appendix: Garsia’s Proof of the Maximal 
Ergodic Lemma 


This appendix, which is not used above, notes that Garsia’s celebrated 
argument for Birkhoff’s ergodic theorem [G70] has a minor subadditive 
extension, although not strong enough to yield Kingman’s theorem. 


Lemma 4.1. Let a(n,x) be a subadditive ergodic cocycle as in the 
introduction. For 1 <n < o& define 


En = [> sup a(k, x) = of. 


1l<k<n 

Then 

i a(1,z)du > 0 and / a(1,x)du > 0. 

Ei, Exo 
Proof. Let 

hn(z) = sup a(k, 2), 
1<k<n 

so 


he (2) —a(k,x) >0 


for k < n. By positivity and linearity of T, viewed as (Koopman) 
operator on functions, 


Tht > Taz 
for k <n. (ag(x) = ak, 2).) 
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Hence 
a+ Thx > a+ Tag > agyi 
by subadditivity. So 
ay > ap41 — Thi 
for all k,1 < k < n and trivially for k = 0. Therefore 


a> sup aga, — Thi = hy — Thi. 
O0<k<n-1 


Now integrate 


[azf In — f Tht = | ny f Thi 
En En En x En 
> f nt-f Tht > 0, 
XxX Xx 


because T is measure preserving (or contractive as in [G70] would be 
enough) and h; > 0. 
The statement about E4. = J Ey follows from passing to the limit. 
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Abstract 

In this chapter we study random walks on a finitely generated group G 
which has a free action on a Z"-tree. We show that if G is nonabelian and 
acts minimally, freely and without inversions on a locally finite Z”-tree 
IT with the set of open ends Ends(I), then for every nondegenerate 
probability measure on G there exists a unique p-stationary 
probability measure v, on Ends(I), and the space (Ends(I), v,) is a 
u-boundary. Moreover, if jw has finite first moment with respect to the 
word metric on G (induced by a finite generating set), then the measure 


space (Ends(I’), v,) is isomorphic to the Poisson—Furstenberg boundary 
of (G, 1). 
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1. Introduction 


In this chapter we study random walks on Z”-free groups and the 
boundaries of such groups. This family of groups is a particularly nice and 
well-studied subclass of groups acting freely on A-trees (A-free groups), 
where A is an arbitrary ordered abelian group. 
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The theory of group actions on A-trees goes back to the work [23] 
of Lyndon who introduced abstract length functions on groups, axioma- 
tizing Nielsen cancellation method; he initiated the study of groups with 
real valued length functions. Later, in [6], Chiswell related such length 
functions with group actions on Z- and R-trees, providing a construction 
of the tree on which the group acts. At about the same time, Tits in [27] 
gave the first formal definition of an R-tree, which is a geodesic metric 
space with a tree-like structure. Eventually, in [25], Morgan and Shalen 
introduced A-trees for an arbitrary ordered abelian group A and the 
general form of Chiswell’s construction. A A-tree is a metric space whose 
metric takes values in A and is subject to certain tree axioms. The theory 
of group actions on such objects was consistently developed by Alperin 
and Bass (see [1]), where authors state the fundamental problem: find 
the group theoretic information carried by a A-tree action, in particular, 
the structure of A-free groups. Whereas the case of Archimedean free 
actions, that is, when A = R, is basically closed by the Rips’ Theorem 
that describes finitely generated R-free groups (see [8, 11]), the general 
non-Archimedean case is still open, though a lot of progress was made 
(see [2, 14, 20, 21]). 

The case when A = Z” with the right lexicographic order received a 
lot of attention due to the natural combinatorial structure of Z”-trees. 
This structure was exploited in [19] and [22] to obtain a description 
of finitely generated Z”-free groups in terms of free products with 
amalgamation and HNN extensions of a particular type (see also [2]). 
The class of Z”-free groups is a natural generalization of free groups 
which contains limit groups, R-free groups, etc. and which is closed 
under taking subgroups, free products, and amalgamated free products 
along maximal cyclic subgroups (n is not preserved in general). All these 
groups are hyperbolic relative to noncyclic maximal abelian subgroups 
(see [9, 14]) (hyperbolic if all maximal abelian subgroups are cyclic), 
coherent, with nice algorithmic properties. 

Let (X,d) be a A-tree. One can define an equivalence relation on 
maximal linear subtrees of X, which are called X-rays. The equivalence 
classes under this relation are called ends of X and the subset of open 
ends, that is, ends not ending at a point of X, is denoted by Ends(X). 
An open cut of X is a pair (Xo, X1) of open subtrees Xp and X1 of X 
such that X = Xo |_| X1. An open cut (Xo, X1) corresponds to a pair of 
open ends (e0, e1) respectively of Xo and X1 and we denote the set of 
all such pairs (eo, e1) arising from open cuts by Cuts(X) (see Subsection 
3.1 for details). 

Now that the sets Ends(X ) and Cuts(X) are defined, we can introduce 
the set 


X: = X UEnds(X) U Cuts(X). 
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In the case when A = Z” the set X can be equipped with a metric, so 
that X is compact in the topology induced by this metric (see (13.6), 
Subsection 4.1). 

The main result of the chapter is formulated below. 


Theorem 1.1. Let G be a countable nonabelian group acting freely 
and without inversions on a Z”-tree T, for some n € N. Assume also 
that the action of G is minimal, that is, TP has no proper G-invariant 
subtrees. Then, for every non-degenerate probability measure uw on G: 


(1) there is a unique w-stationary measure v, on the space Ends(I) 
of ends of T; this measure is continuous (i.e. it takes zero value 
on each point); the measure space (Ends(T), vz) is a -boundary 
of (G, 4m), 

(2) for almost every path t = {ti} of the w-random walk, all the 
sequences {t;-v}, uv € I, converge to a random end w(t) € Ends(T); 
the distribution of the limits w(t) is given by the p-stationary 
MEQSUTE Vy, 

(3) if G is finitely generated and y has finite first moment with respect 
to the word metric on G (induced by a finite generating set), then 
the measure space (Ends(I‘),vy,) %s isomorphic to the Poisson— 
Furstenberg boundary of (G, 4). 


In the proof of the above theorem we use methods of [10] and [5], and 
the structure of the proof is as follows. At first, in Section 4, we introduce 
a metric dp on I which makes T a compact metric space equipped 
with a continuous action of G (Theorem 4.8). Next, we prove that for 
every nondegenerate measure pz on G there exists a unique p-stationary 
measure v, on I and (I, v,) is a w-boundary of (G, 2) (see Proposition 
5.6), which implies that the space TP is u-proximal. Then, since the 
measure v, on I is concentrated on Ends(I’) (see Proposition 5.1), we 
deduce that (Ends(T°), v,) is a -boundary too, and hence p-proximal 
(see Corollary 5.7). Finally, we show that if w has finite first moment 
with respect to the word metric on G induced by a finite generating set, 
then (Ends(I), v,) is a maximal y-boundary (Proposition 5.11). 

Observe that there are other constructions of the Poisson boundaries 
of Z”-free groups. First of all, since every Z”-free group G is hyperbolic 
relative to its non-cyclic abelian subgroups (follows from results of [14] 
and [9]), one can study its Floyd boundary (see [12, 13]) which is 
nontrivial. Hence, from [17] it follows that the Floyd boundary of G 
is its Poisson boundary. Another approach is to use the fact that G is 
CAT(0) with isolated flats (see [4]), so one can construct its Poisson 
boundary using results of [18] (see also [26]). 

The advantage of the construction presented here is that it comes 
naturally from the action of a Z"-free group G on the underlying Z”-tree 
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without reference to general results about relatively hyperbolic groups 
and CAT (0)-groups, and provides a description of the boundary in terms 
of the underlying Z”-tree. 
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2. Preliminaries: Poisson—Furstenberg Boundaries 


2.1. Random Walks and Poisson—Furstenberg Boundaries 


Let G be a countable group and let uw be a probability measure on G. The 
measure pt is non-degenerate if its support generates G as a semigroup. 
The right-hand random walk on G with distribution jw (or, briefly, 
pe-random walk) is the time-homogeneous Markov chain whose state 
space is G, the transition probabilities are given by P(g, h) = u(g~'h), 
and the initial distribution is concentrated at the identity 1g of the 
group. Realizations of this process are called paths of the random walk. 
The path space of the random walk (G,) is the measure space 
(GNo, P.,) (No: = {0,1,...}), where the measure P,, is determined by 
the following condition. Let Cy,,....9,, p € No, denote the set of paths 


C190 055 9p = | {ri}ieno € GNo | (to, eee) Tp) == (90, ig op) 2 
Then 


Py (Co pdp) = 81.6 (G0) X M(Go GL) X +++ X MGp 1 9p): 


where 61, is the probability measure on G with 61,({1q}) = 1. The 
o-algebra of P,-measurable subsets is the P,,-completion of the 
o-algebra generated by the cylindrical sets C4g,...,9,. Since G is 
countable, the path space (Go, P,,) is a Lebesgue space. 

There are several ways to define the Poisson—Furstenberg boundary 
for random walks. In particular, the Poisson—Furstenberg boundary 
of the random walk (G,) can be defined as the quotient measure 
space of the path space (GN, P,,) with respect to the measurable hull 
(= measurable envelope) of the partition ¢ whose elements are the classes 
of the equivalence relation ~ (on the set GN0 of paths) defined as follows: 


(TiJieNy ~ (TpieNy <> FK,K: thay = ty yy VI > 0. (13.1) 


The measurable hull and hence the Poisson—Furstenberg boundary are 
objects defined modulo subsets of measure 0 (mod 0). 
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The partition ¢ is invariant under the action of G on GN defined by 
the rule g(to,71,---) = (gto, 9T1,.--). This action induces a canonical 
action of G on the Poisson—Furstenberg boundary. 


2.2. Furstenberg j-Boundaries 


Our exposition of the theory of Furstenberg j-boundaries is based 
on [10]. 

An action of a group G on a topological space M is a homomorphism 
from G to the group Homeo(M) of all homeomorphisms of M. An action 
is said to be (topologically) minimal if each orbit of the action is dense 
(or, equivalently, if the action has no proper closed invariant subsets). 
A space endowed with an action of the group G is called a G-space. 
A map f: X — Y of G-spaces is said to be equivariant if g(f(x£)) = 
f(g(z)) for alla e X,g€G. 

We denote the space of all probability measures on G by P(G) and 
the space of all Borel probability measures on M by P(M).! We endow 
P(M) with the weak* topology (by duality with the space Cy(M) of all 
bounded continuous functions on M). A subbase for the weak* topology 
is the collection of sets of the form 


{ye Pan v<(f sav) <a}, where f € Cy(M), a,deER. 
M 


Thus, a sequence {;}jeNy Of Borel probability measures converges with 
respect to the weak* topology (converges weakly) to a measure v if 
and only if for all f € Cy(M) the numerical sequence {fut dvi} Seng 
converges to fy, f dv. 

Any action of G on M induces an action of G on P(M): 


(g-0)(E) = 0(g"! - E). 


If c € M, then the Dirac measure 5; € P(M) is the measure defined by 
6¢(£) = 1 if  € E and 6;(£) = 0 otherwise (for a measurable EF). We 
say that a measure is continuous if it takes zero value on each point. 
Suppose that uw € P(G). A measure v € P(M) is said to be p-stationary 
if 
MeV = Yi g- vy) U9) =v. 
géEeG 


Assertion 2.1 ({10]). Let G be a countable group acting on a compact 
metric space M and let u € P(G) be an arbitrary measure. Then the set 
of u-stationary measures in P(M) is nonempty. 


lin [10], the space of compact-regular probability measures is considered. We are 
interested in Polish (separable complete metric) spaces or Borel subsets thereof. 
All Borel probability measures on these spaces are compact-regular. 
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Lemma 2.2 ({16]). Let G be a countable group acting on a space 
M, uw a non-degenerate probability measure on G and v € P(M) be a 
p-stationary measure. Let E C M be a measurable subset such that for 
every g € G we have either g-E = E, or (g:- E)N E = ©. Suppose 
further that there is an infinite family of pairwise-disjoint sets of the 
form g- E, where g € G. Then v(E) = 0. In particular, if the orbit G+ a 
of every point x € M is infinite, then v is continuous. 


Lemma 2.3. Let G be a countable group acting minimally on a 
topological space M, w be a nondegenerate measure on G, andv € P(M) 
be a w-stationary measure. Let E C M be a nonempty open subset. Then 
v(E) > 0. 


Proof. Observe that by definition, the u-stationary measure v is pers 
stationary for every k ¢ N. Consequently, for every k € N we have 


v(B) = > p*(g)(g- v)(B) = Yo w**(g)vg! - B). (13.2) 


géEG géEeG 


Suppose that v(#) = 0. Then (13.2) implies that v(g~!-E) = 0 whenever 
there exists k € N such that u**(g) > 0. Hence, vig! - FE) = 0 for each 
g € G because mw is nondegenerate. On the other hand, since G acts 
on M minimally, and F is open and nonempty, it follows that for each 
xz € M there exists g € G such that g- a € E. Hence, 


Lo? -£=|(Jgo- B=. 
gEG géEeG 


Since G is countable, it follows that 


1=v(M)=v LJot-# < )) vg"! BE) =0. 


géG geG 


This contradiction proves that v(E£) > 0. 


Theorem 2.4 ([10, 16]). Let G be a countable group acting on 
a compact metric space M. Take wEP(G) and let ve P(M) be a 
p-stationary measure. Then for P,,-almost every (a. e.) path t = {ti}ieNg 
of the y-random walk , the sequence {ti - V}ieN, converges to some 
measure X(t) € P(M). 


Definition 2.5. Suppose that M is a compact metric G-space, uw € 
P(G), and v € P(M) is a p-stationary measure. The pair (M, v) is called 
a Furstenberg u-boundary for G if for a.e. path t = {t;}ien, of the right 
p-random walk, the sequence of measures {t; - v}jcNy converges to some 
Dirac measure 5, (r), Where w(t) € M. 
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Definition 2.6. Let ju be a probability measure on G and let (X,v) 
be the Poisson—Furstenberg boundary of the pair (G, uw). The quotient 
measure space of (X,v) with respect to a G-invariant measurable 
partition will be called a x-boundary in the sense of Kaimanovich for G. 


Assertion 2.7. Let G be a countable group acting on a metric space M. 
Take a uw € P(G) and let v € P(M) be a w-stationary measure. Assume 
that for Py-a.e. path {ti}ieNy of the u-random walk the sequence of 
measures {Tj -V}icN, converges weakly to the Dirac measure bz for some 
zeM. Let 


bnd: GNo + M 
be the (P,,-4. e. defined) map that sends the path {Ti}ieNy to x whenever 


{ti -V}icNy converges to db. Then bnd is Py-measurable and bnd(P,,) = 
2 
v. 


Proof. Let us first show that for each bounded continuous f: M > R 
the composition 


F: =fobnd: GY +R 


is measurable (P,,-measurable) and 


i. F dP, = [i dv. (13.3) 


Observe that the sequence {F’;}xeN9 Of measurable functions 
F,: Go +R, Fy(t) = / f dt, (v) 
M 


is uniformly bounded (because f is bounded) and P,-a.e. converges 
pointwise to F (by the assumption of the assertion). Then by the 
Lebesgue bounded convergence theorem F' is measurable and® 


te F dPy = oe la Fy dPy = a Ng cs: ars) dP, (tT). 


Observe that the distribution of t, is given by p** and since v is 
p-stationary we have v = u** xy, whence 


he dF, i) Gy dX ((f. iy) 3 a) 


= faut +r) = | f dv. 
M M 


2 Since bnd is only Py-a.e. defined, it makes sense to consider measurability with 
respect to the P,-completion. 

3 Recall that, more generally, the pointwise limit of a sequence of measurable maps 
from a measurable space into a metric space is measurable. 
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We have thus proved that F' is measurable and 


i. BdP = ‘Tim: F;, ap, = f f dv. 
GNo k->oo JGNo M 


Recall that a map m: Q — X from a measure space Q to a metric 
space X is measurable (with respect to the Borel o-algebra on X) if and 
only if the compositions f om: Q — R are measurable for all bounded 
continuous f: X — R. (This readily follows from the fact that every 
metric space is perfectly normal.) It then follows from the above that 
bnd is measurable. 

Since bnd is measurable, the Borel measure bnd(P,,) on M is well- 
defined and for each bounded continuous f: M — R we have 


/ f dbnd(P,,) =f f obnd dP,, 

M GNo 

whence it follows by (13.3) that 
/ f dbnd(P,) ay f dv. (13.4) 
M M 

Recall that, in a metric space, distinct Borel probability measures 


determine distinct functionals on the space of bounded continuous 
functions. Therefore, (13.4) implies that bnd(P,,) = v as required. 


Assertion 2.7 means that every Furstenberg u-boundary is a quotient 
of the measure space (GNo, P,,) (as a measure space, disregarding the 
topology) with the quotient map bnd. Moreover, since bnd sends each 
class of the equivalence relation ~ (defined in (13.1)) to a point, it follows 
that a Furstenberg u-boundary is an equivariant quotient of the Poisson— 
Furstenberg boundary. This proves the following corollary. 


Corollary 2.8. Every Furstenberg -boundary is a u-boundary in the 
sense of Kaimanovich. 


Definition 2.9. (See [10].) Suppose that (I, d) is a metric G-space, 
bh € P(G). Let 


Hen) = (Wt we? tt )/n. 
M is said to be u-proximal if for all c, y € M and ¢ > 0 we have 
Mn gld(gz, gy) > &} ——> 0. 
Noo 
M is called mean-prozimal if it is w-proximal for all nondegenerate wu € 


P(G). 


Theorem 2.10. Let G be a countable group acting on a compact metric 
space M. Then the following conditions are equivalent: 
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(a) M is mean-proximal. 

(b) For any nondegenerate 4 € P(G) and w-stationary v € P(M), the 
pair (M,v) is a Furstenberg w-boundary for G. 

(c) For each nondegenerate we P(G) there exists a unique p- 
stationary vEP(M) and the pair (M,v) is a Furstenberg p- 
boundary for G. 


Proof. The equivalence (a) = (0) follows directly from [10, Theo- 
rem 14.1]. The implication (a) > (c) is proved in [24, Lemma3.1]. The 
implication (c) => (0) is trivial. 


2.3. The Strip Criterion of Kaimanovich 


We use the following corollary of the ‘Strip Criterion’, established by 
Kaimanovich in [15]. 


Corollary 2.11 ([15]). Let G be a finitely generated group with the 
induced word metric |-|. Let uw be a probability measure on G with finite 
first moment ¥~ |g|u(g). Let jt be the reflected measure defined by ju(g) = 
ug). Let (By,44) and (B_,d_) be w- and jt-boundaries (in the sense 
of Kaimanovich), respectively. If there exists a measurable G-equivariant 
map S assigning to pairs of points (b_, b+) € B_ x By nonempty ‘strips’ 
S(b_, b4) C G such that for (A~ x A+)-almost every (b_, b+) € B_ x By 
we have 


1 
— log card{g € S(b_, by) | |g| < i} —= 0, (13.5) 
a 17> 0 


then (B,,A+4) is isomorphic to the Poisson—Furstenberg boundary of the 
pair (G, 1). 


Remark 2.12. Note that, under the Strip Criterion (Corollary 2.11), 
the ‘strips’ S(b_, b,) are required to be 


(i) all nonempty, 
(i?) (AJ x A4)-almost surely ‘thin’. 


Clearly, since the strips are allowed to meet the ‘thinness’ requirement 
(A_ x A+)-almost surely (not surely), we can handle the ‘nonemptiness’ 
property in the same way. In other words, in order to use the Strip 
Criterion it suffices to construct a (measurable, equivariant) map 
S’: BL x B, > 2 with strips, which are 


(i?) (AL x A4)-almost surely nonempty, 
(ii?) (A- x A4)-almost surely thin. 


This is clear, because we can pass from S’ to a map S$ with the 
property (i) by setting S(b_, b4) = G if S’(b_, b,) = @ and S(b_, bi) = 
S’(b_, b4) otherwise. 
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Note also that, having a map with the properties (i’) and (ii’), we can 
replace all nonthin strips by empty ones and thus obtain a map that has 
the property (i) and the property 


(ii) all strips are ‘thin’. 


3. Preliminaries: Z"-Free Groups 


Following [1] we give some basic definitions from the theory of A-trees 
and then consider the case when A = Z”. All the details can be found 
in [1] and [7]. 


3.1. A-Trees 


A set A equipped with addition ‘+’ and a partial order ‘<’ is called a 
partially ordered abelian group if 


(1) (A,+) is an abelian group, 
(2) (A, <) is a partially ordered set, 
(3) for alla,B,y € A,a < Bimpliesa+y <Bt+y. 


An abelian group A is called orderable if there exists a linear order ‘<’ 
on A, satisfying the condition (3) above. In general, the ordering on A 
is not unique. 

For elements a, B € A, the closed segment [a, B| is defined by 


a,b) ={y eAla<y <p}. 


A subset CC A is called convex if for every a, B € C the set C contains 
[a, 6]. In particular, a subgroup C of A is convex if [0,6] C C for every 
positive BE C. 

Let A be an ordered abelian group. A A-metric space is a pair (X, d), 
where X is a nonempty set and d is a A-metric on X, that is, a function 
d: X x X > A satisfying the usual metric properties. A-metric spaces 
for an arbitrary ordered abelian group A were first introduced by Morgan 
and Shalen in [25]. 

If Ao is a convex subgroup of A, for any point zo € X the subset 


Xo = {y € X | dz, y) € Ao} 


of X is a Ag-metric space with respect to the metric dp = dx, called a 
Ao-metric subspace of X. 

If (X, d) and (X’, d’) are A-metric spaces, an isometry from (X, d) to 
(X’, d’) is a mapping f: X > X’ such that d(x, y) = d’(f(2), f(y)) for 
allz,yeX. 

A (geodesic) segment in a A-metric space is the image of the isometry 
a: [a,b], > X, where [a, b], is a segment in A between some a,b € A. 
The endpoints of the segment are a(a),a(b). 
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We call a A-metric space (X, d) geodesic if for all x,y € X, there isa 
segment in X with the endpoints z, y. We call (X, d) geodesically linear 
if for all x, y € X, there is a unique segment in X whose set of endpoints 
is {x, y}. We denote such a segment by [z, y]. 

A A-tree is a nonempty A-metric space (X, d) such that 


(T1) (X, d) is geodesically linear, 

(T2) if z,y,z € X then [z, y] N[z, z] = |x, w] for some w € X; this w 
is unique and we write w= Y(y,12, 2), 

(T3) if v,y,z € X and [z, y] O[y, z] = {y} then [z, y] U[y, z] = [z, 2]. 


A nonempty convex subset Xo of a A-tree X is called a subtree. Every 
subtree of a A-tree is obviously a A-tree itself (the axioms (T1)—(T3) 
hold for Xo since it is convex). Xo is closed if its intersection with any 
closed segment of X is either empty, or a closed segment, otherwise Xo 
is open. 

A A-tree is called linear if it is isometric to a subtree of A. 

Let (X, d) be a A-tree. A point e € X is an endpoint of X if, whenever 
e € [x,y] C X, we have either e = x, or e = y. A linear subtree from 
xz éX (or, originating from x) is any linear subtree L of X such that x 
is an endpoint of L. Observe that L carries a natural linear ordering (for 
y,2eL: ys z= d(a,y) < d(z,z)), with x as the least element. 
If there exists a maximal point y € L with respect to this ordering then 
L is just a closed segment [x,y] in X. If v € L, then Ly = {z € L | 
v < z} is a linear subtree from v and L = [z, v] U Ly, [x, vo] N Ly = {v}. 
A maximal linear subtree from x in X is called an X-ray from x. By 
[1, Proposition 2.22], the relation ‘LN L’ = Ly = Li, for some v’ is an 
equivalence relation on the set of X-rays. The equivalence classes under 
this relation are called ends of X. We distinguish closed ends and open 
ends of X: an end e is closed if all X-rays associated with e (ending at e) 
are closed segments |x, e], otherwise e is open. The set of open ends of 
X will be denoted by Ends(X). 

For any x € X and e € Ends(X) there exists a unique X-ray from 
x ending at e which we denote by |, e). Similarly, if e1, eg € Ends(X) 
are distinct ends of X, then there exists a unique linear subtree of X, 
which we denote by (e1, e2), connecting the open ends e; and eg (see 
[1, Proposition 2.24]). 

An open cut of X is a pair (Xo, X1) of open subtrees Xo and Xj 
of X such that X = Xo |_|] X1. By [1, Proposition 2.26], every open cut 
(Xo, X1) uniquely corresponds to a pair of open ends (e9, e1) respectively 
of Xo and Xj. Denote the set of all such pairs (eo, e1) arising from open 
cuts by Cuts(X). 

Suppose Y C X is a subtree of X. Then all open ends of Y are 
also open ends of X, from which it follows that Cuts(Y) C Cuts(X). 
At the same time, not all Y-rays are X-rays, so in general Ends(Y) ¢ 
Ends(X). 
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Next, if x,y,z € X, then [x,y] N [x,z] = [z,w] for some w € X 
by definition of A-tree. We denote the point w by Y(az,y,z). By 
[(1, Proposition 2.11], Y (a, y, z) does not depend on the order in the triple 
{x, y, z}, that is, for example, Y(y,x,z) = Y (az, y, z). It also follows that 
Y(2z,y,2) = [z,y] N[z,z] N [y, z]. We are going to call Y(z,y,z) the 
median of the triple {x, y, z}. 

We say that a group G acts on a A-tree X if any element g € G 
defines an isometry g: X > X. An action on X is nontrivial if there is 
no point in X fixed by all elements of G. Note that every group has a 
trivial action on any A-tree, when all group elements act as identity. An 
action of G on X is minimal if X does not contain a proper G-invariant 
subtree. 

Observe that an action of G on X induces actions respectively on 
Ends(X) and Cuts(X). 

Next, g € G is called elliptic if it has a fixed point. An isometry g € G 
is called an inversion if it does not have a fixed point, but g? does. If 
g is not elliptic and not an inversion, then it is called hyperbolic. For a 
hyperbolic element g € G define the characteristic set 


Avis(g) ={p € X |[g7!- p, np] ON [p, 9+ vp] = (Hh, 


which is called the axis of g. Azis(g) meets every (g)-invariant subtree 
of X. 

A group G acts freely and without inversions on a A-tree X if for all 
lg #49 € G, g acts as a hyperbolic isometry. In this case we also say 
that G is A-free. Observe that, if G is A-free and f,g are hyperbolic, 
then [f, g] =1@¢ implies Azis(f) = Axis(g), hence we denote Azis(g) by 
Axis(CG(9))- 


3.2. Groups Acting on Z”-Trees 


Now suppose A = Z” is considered with the right lexicographic order 
and let (X,d) be a Z"”-tree. Observe that Z” contains the minimal 
positive element (1,0,...,0) which we are going to denote by 1 (it will be 
clear from the context whether 1 denotes an element of Z”, or a natural 
number). 

Let k € [1,n]. We say that 2,y € X are Z*-equivalent («& ~zx y) 
if d(z,y) € Z*, that is, d(z,y) = (a,...,a,,0,...,0). From metric 
axioms it follows that ‘~z:’ is an equivalence relation and we call the 
corresponding equivalence classes maximal Z* -subtrees of X. Denote by 
&;,(X) the set of all maximal Z‘-subtrees of X. 

Observe that X/~zx is a Z"—*-tree with a metric d’ induced from d 
(see, for example, [20] for details). The set of vertices of X/ ~gx 
can be naturally identified with &,(X), so one can lift d’ to the metric 
dy : Sy(X) > Z"-* on &,(X) as follows: if m : X > X/ ~ze is 
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the canonical projection, then for S,7T € &,(X) define d,(S,7T): = 
d'(x(S),2(T)). 


Let G be a group acting freely and without inversions on X (in this 
case G is called Z"-free). It is not hard to discover the structure of G 
using Bass-Serre theory. By contracting elements of &,-1(X) to points 
we obtain from X a Z-tree Y equipped with the action of G inherited 
from X. From Bass-Serre Theory it follows that Vg = Y/G is a graph 
in which vertices and edges correspond to G-orbits of vertices and edges 
in Y and G is isomorphic to the fundamental group of the graph of 
groups associated with Wg. More precisely, if v ranges through the set 
of vertices of Wg, then G can be obtained from groups Gy (each Gy is 
a Z"—|-free group isomorphic to the G-stabilizer of some vertex of Y) 
by means of amalgamated free products and HNN extensions along free 
abelian groups Ge of rank not greater than n — 1. Here, every Ge is 
isomorphic to the G-stabilizer of some edge (an ordered pair of adjacent 
vertices) of Y. 

In the case when G is finitely generated, Wg is finite (see, for example, 
[22]). Now, since G and all Ge are finitely generated, and Wg is finite, 
it follows that all G, are also finitely generated (see, [8, Proposition 29, 
Proposition 35]). Using induction on n one can similarly show that the 
G-stabilizer of any maximal Z*-subtree of X, where k € [1,n—1], is also 
finitely generated. In particular, it follows that if G is finitely generated, 
then it is finitely presented. 

If G is finitely generated and H < G is also finitely generated, then 
by [20, Theorem 65] it follows that H is quasi-isometrically embedded 
into G. We are going to use this fact in the proof of Proposition 5.11. 

In our further investigations we are going to need the following 
technical definition (see [21] for details). 

For every vup,v1 € X such that d(vo,v1) = 1 we call the ordered 
pair (vo, v1) the edge from vp to v1. Here, if e = (u,v) then denote 
up = o(e), vy = t(e) which are respectively the origin and terminus 
of e. Denote by Ed(X) the set of edges of X. 

If Y C X is a subtree of X then we can define Ed(Y) as above by 
replacing X with Y since the metric on Y is induced from X. Obviously, 
Ed(Y) € Ed(X). 

If Y and Z are subtrees of X such that Y C Z then Ed(Y) C Ed(Z). 


4. From Z”"-Trees to Metric Spaces 


Let I be a countable Z”-tree with a designated point ¢. In this section we 
construct a compact metric space I associated with IP. Next, we study 
various properties of F and show, in particular, that any isometric action 
of a group G on I extends to a continuous action of G on I. 
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4.1. Compactification of a Z"-Tree 
Let (C, d) be a countable Z”-tree. Define 


P := TP UEnds(P) U Cuts(P), 


where Ends(I) and Cuts(I’) are the sets of open ends and cuts of T (see 
Section 3.1 for the definitions). 

Recall that if 2 ¢ T and e € Ends(I), then [z, e) is the unique I’-ray 
from x to e. Similarly, (e1, e2) is the unique linear subtree connecting 
distinct open ends ej, €2 of T. 

Now, let a,b € T’. If a = 5, in the case when a ¢ I, define |a, b| = 2, 
and when a € I, define La, b| = {a} = {b}. Otherwise, if a 4 b then we 
define [a,b] to be a linear subtree of I’ as follows. 


e If a,b €T then La, b| = [a, db]. 

e If a,b € Ends(L) then |a, 6] = (a, b). 

e IfaeTl, b € Ends(L) then La, b| = [a, bd). 

e Let a € I and b = (e@,e1) € Cuts(T), where e; € Ends(T,), 

[T =loL|Vi and a € Tp. Then define La, b| = [a, eo). 

Let a € Ends(T) and 6b = (e, e1) € Cuts(T), where e; € Ends(T,), 

Tl =VloL]l1. Then a is an end of one of the subtrees To, T1, say 

a € Ends(19), so, we define La, b| = (a, eo). 

e Let a = (e€9,e1), where e; € Ends(Ty),l = ToL]li, and b = 
(fo, fi) € Cuts(T). It follows that 6 is an open cut of one of the 
subtrees 9, ['1, say b € Cuts([9). Now, since eg € Ends(I9) and 
b € Cuts([9), replacing with Mo in the definition case above we 
can assume that [eo, b| is defined for Tg and set La, b| = Leo, db]. 


It is easy to see from the definition above that La, b| = |b, a] for any 
a,b €T. Also, if [a,b] 4 @ then a and 6 can be considered as ends of 
La, b| (closed if points in I, open otherwise). 


Lemma 4.1. JfaeéTI, then the map py: bt [a,b] establishes a one- 
to-one correspondence between I and the set of all linear subtrees from 
amr. 


Proof. From the definition above it follows that La, b,| 4 |a, be] if 
by 4 bg. Hence, w is injective. In order to show that it is also surjective, 
for each linear subtree L of T from a we show that there exists b € T 
such that L = [a,b]. 

If L is closed, then L = [a, b] for some b ET. That is, L = a, db]. 

Now, let L be open. If Z is maximal, then it is a T-ray and L = [a, 6) 
for some 6 € Ends(I) and we have L = [a, db]. 

If L is not maximal, then there exists a subtree I’ of F such that L is 
aI’-ray from a (for example, L itself is such a subtree). The union of all 
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such subtrees I’ is a subtree [9 of I since every I’ contains L. In fact, 
from the construction of [9 it follows that it is the maximal subtree (by 
inclusion) of [ such that L is a Tg-ray from a. 

Observe that I['9 is open. Indeed, since LZ is not maximal in T, it can 
be extended to a I-ray L’ from a and there exists x € L' \ L. Hence, 
x €Vo (because L is maximal in T9) and [9 N[a, 2] = L which is not a 
closed segment. That is, [9 cannot be closed, hence, it is open. 

Next, consider Ty = [\ Tg. We are going to show that T; is a subtree 
of I. Observe that Ly = L/ \ L is a linear tree contained in Ty and x 
defined above belongs to L,. If y € Ty is such that [x,y] NTo 4 2, 
then y ¢ [1 and I’ = [y, a] UL is a subtree of I such that L is a T’’- 
ray from a. But then we get a contradiction with the construction of 
Io since I’ ¢ To. The contradiction comes from the assumption that 
[x,y] Ao € @ and it follows that [z, y] ¢ [1 for every y € Ty. In other 
words, I’; is convex, hence, it is a subtree of I. 

Moreover, Ty is open. Assume on the contrary that Ty is closed and 
consider £1 C Ty. Since Ty is closed, LZ is also closed and there exists 
z € Ly such that [a,z]Q Ly = [z, 2]. Now, since the metric on I takes 
values in Z”, there exists a point 2 € L’ such that d(a, z) = d(a, a) +1, 
where 1 is the minimal positive element of Z”. Obviously, 2 ¢ L, because 
otherwise [a,z] MN L1 = [2,2] — a contradiction. Hence, z € L and 
L = |[a, 2] — a contradiction with the fact that DL is open. 

Thus, I; is open and it follows that 6 = (9,T'1) is an open cut such 
that L = |a, 6]. 

From the argument above it follows that for any linear subtree LZ from 
a there exists b € F such that L = [a, b| and the statement of the lemma 
follows. 


Below we are going to use the following convention: for an edge e € 
Ed(r) and a,b € T we write e € |a, b| meaning that e € Ed|a, b|. Also, 
if a = b, then obviously Eda, b| = @ since [a, b] in this case is either 
empty, or consist of a single point of I. 


Lemma 4.2. Jf a,b,c €T then the following hold. 


(a) If |a,b] #4 @ and x € [a,b], then |a,b| = a,x] U |z,b] and 
La,z}N La, bj = {zr}. 

(b) Ifa eT and |a,x] N La, b| = {x}, then La, x] U La, b] = La, dJ. 
(c) If a,b,c are pairwise distinct, then La,c] N Lc,b] #4 @ unless 
c = (e9,e1) € Cuts(T), where e; € Ends(Ty), 7 = ToL]Ti, és 
such that La,c|] CTo and [c, bJ CT. 
) La,c] ULe, b] is a subtree of T. 
(e) La,b] © Lae] U Le, BI. 
(f) Edla, 6] C Edla, c}] UEdLe, bJ. 

) Eda, b] NEdla,c|] NEdLb, c] = 2. 
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(h) Ifc eT and |a,b] 4 2, then 
Lc, a] AN Le, b] € Le, z] 
for any z € [a,b]. 


Proof. (a) If a,b €T, then by definition of ‘|, |’ we have |a, b] = [a, 6]. 
Since z € [a,b], we have [a,z] N [z, 6] = {x} and [a, b] = [a,x] U [z, b] 
by definition of A-tree. At the same time, La, x] = [a,2], Lv, b] = [z, 5], 
and the statement follows. 

Ifa € T and 6 € Ends(I), then by definition of “| , |”, we have La, b| = 
[a, 6) which is a linear subtree from a. Since x € [a, b), it follows that 
|x, b) is a linear subtree from x such that we have [a, 6) = [a,x] U [2, b) 
and [a,x] [z, 6) = {x} (see [1, Definitions 2.21]). Now the statement 
follows since [a, z] = |a,x] and [z, b) = a, bJ. 

Ifa € Tl and b = (eo, e1) € Cuts(I’), where e; € Ends(y), l =ToLJT1 
and a € T9, then by definition of “|, |”, we have La, b] = [a, eg). This 
is a linear subtree from a in Ig, so the required statement follows from 
the argument given in the previous paragraph with I replaced by I. 

Suppose a, b € Ends(I). Hence, by definition of ‘| , |’, we have |a, b| = 
(a, b). Since x € (a,b), it follows that (a, 2] [z,b) = {x}. Now from 
[(1, Proposition 2.24], it follows that (a,z]U [z,b) = (a,b). Hence, the 
statement holds since (a,z] = |a,xv] and [z,b) = |z,b] by definition 
of ‘|, |’. 

Suppose a € Ends(T) and 6 = (e9,e1) € Cuts(I), where e; € 
Ends(f;), 0 = ToL] T1. Then, assuming that @ € Ends(Ip), by definition 
of ‘L, |’, we have [a,b] = (a, eg). Since e9, a € Ends(T9), the statement 
follows from the argument above with TI replaced by To. 

Finally, suppose that a = (eo, e1), where e; € Ends(T;), F = FoLJV, 
and b = (fo, fi) € Cuts(P). Assuming that 6 € Cuts([9), by definition 
of ‘L, |’, we have La,b|] = [eo,b] and the statement follows from the 
argument above with I replaced by Fo. 

(b) If a,b € T then the statement holds by definition of A-tree. 

If a € F and 6 € Ends(L), then La,z] = [a,z], |x, b] = [x,b). In 
particular, [z, b) is a linear subtree from x and [a, z|]N[z, b) = {2}. Hence, 
[a, z]U[z, b) is a linear subtree from a which ends at 6 (see [1, Definitions 
2.21]). That is, [a,z] U[a, 6) = [a, 6) = La, b] and the statement follows. 

Ifa € T and b = (e9, e1) € Cuts(), where e; € Ends(;), 7 = To LJ Ti 
and a € [9, then by definition we have |.a, b| = [a, eo). It follows that x € 
To because otherwise we have a contradiction with [a, 2] N[z,b) = {2}. 
Now, the statement follows from the argument above with I’ replaced 
by To. 

Suppose a,b € Ends(I). By [1, Proposition 2.24], for any z ¢ T we 
have [z, a) U[z, 6) = (a, b) whenever [z, a) N[z, 6) = {z}. This implies the 
statement because we have |x, a] = |x, a), [a,b] = [x, b), and La, b] = 
(a, b) by definition of ‘|, |’ 
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Suppose a € Ends(f) and 6 = (eg,e1) € Cuts(T), where ej; € 
Ends((;), = ToL]. Assume that a € Ends(Io). Hence, x € Vo 
because otherwise we have a contradiction with [x, a) N[z, 6) = {a}. Now, 
the statement follows from the argument above with I’ replaced by Io. 

The case when a,b € Cuts(I) can be similarly reduced to one of the 
previous cases. 

(c) If c € I, then both La, c] and |c, | are linear subtrees of P from c. 
Hence, c € a,c] M|[c, b| and the statement follows. 

Suppose c € Ends(I) and let x € a,c], y € Le, b] be arbitrary (such 
x and y exist since both [a,c] and |c, b] are nonempty by assumption). 
Hence, [x, c) and [y, c) represent the same end of I and by definition of 
ends it follows that [z,c) M[y,c) is nonempty. Hence, La, c] N Lc, bJ is 
also nonempty. 

Now, let c = (eg, e1) € Cuts([), where e; € Ends(T;), 7 = PoL]T1. 
Suppose that both La, c| and [c, b| belong to one of the subtrees 9, T1, 
say T'9. Hence, repeating the argument above with I replaced by "9 we 
obtain that La,c] Nc, b|] #2. 

(d) If at least two of the points a,b,c coincide, then (d) trivially 
holds. So, suppose that a, b, c are pairwise distinct. 

From (c) it follows that La,c] U Lc, 6] is connected (hence, it is a 
subtree of I) unless c = (eg, e1) € Cuts(I’), where e; € Ends(T,),P = 
ToL]Vi, is such that |a,c] C Mo and [c,b] C Ty. Assume the latter 
and take arbitrary points x € La, c], y € Lc, bJ. Then, by [1, Proposition 
2.26], we have [z, y] = [x, e9) U[y, e1). Next, by definition of ‘|, |’, we 
have Lx, c] = [z, eo), Le, y] = (e1, y], hence, 


[x,y] = Lz, c] ULe,y]. 
Applying (a) we have 
La,c]ULe, bJ = (La, zJU La, c/)U(Le, yJULy, bJ = La, x] U[z, y]ULy, 8] 


which is connected. Hence, La, c] Uc, b] is a subtree of T. 

(e) Again, if at least two of the points a, b, c coincide, then (e) trivially 
holds. So, suppose that a, b, c are pairwise distinct. 

Suppose at first that one of the subtrees a,c], Lc, b| is contained in 
the other one, say |a,c| C Lc, b]. If a¢T then a € [c, b|. Hence, from 
(a) we have 


Le, 6] = Le, a] U La, 6] 


and the statement follows. Suppose a ¢ I’. Then it follows that a € 
Cuts(I). Indeed, a cannot be an end of I’ since for every z € [c, a], 
the linear subtree |z,a] is not a maximal linear subtree from z. Let 
a = (e9,€1), where e; € Ends(;), 7 = To LJ Pi. Assume, without loss 
of generality, that c belongs to [9 and 6 belongs to I; (as points, ends, 
or open cuts). From (c) it follows that there exists x € Lc,b] NM La, 6]. 
Indeed, since a € Cuts[c, b], the ends ep and e, uniquely correspond to 
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each other and they cannot be separated by b because b # a. By (a) 
we have 


le,b] =le,e} Ula, 6], Lab] = La,r] Ula, b] 


and it is enough to show that La,z] C |Lc,z]. But La,z] = (a,2| by 
definition of ‘|, |’ and for every y € Lc, x] MT, by [1, Proposition 2.26], 
we have [y, z] = [y, eo) U (e1, 2]. In other words, 


La, xz] = (e1,2] C [y, 2] € Le, x]. 


Now suppose that La,c] ¢ Lc, 6J, Le, b] Z La,cJ. From (c) we have 
the following two cases. 

Case I. [a,c] N|c,b] 42. 

Let x € [a,c] \ [c,bJ and y € Lc, 6b] \ La, c]. Observe that |x, c| and 
Ly, c| are linear subtrees of La, c|ULc, b] (which is a subtree of T by (d)). 
Hence, by [1, Proposition 2.22], there exists v € |a,c] ULc, 6] such that 
either La,c] ON Lc, b] = [c, v] in the case when c ET, or La, c] N|c, bJ = 
(c, v] in the case when c ¢ I. In both cases, by definition of ‘| , |’, we 
have 


La,c] Me, 6] = Le, vJ 


and |a,v|] Lb, uv] = {v}. Now, by (b) we have La, uv] U Lb, v] = La, b} 
and 


La, b] = a,v] ULb,v] C La, ce} U Le, 6], 


where the last inclusion follows from (a). 

Case II. c= (ep, e1) € Cuts(I), where e; € Ends(y), 7 = To LJ Pi, 
is such that |a,c] CT and [c, bJ CT}. 

By (c) there exist « € La,c] Ma, 6] and y € [c,b] Na, 6}. Hence, by 
(a) we have 


La, 6] = La,r]U la, yJU Ly, oJ, 


la,c] =[a,r]Ula,c], Le,b]=Le,yJULy, 6]. 


Hence, in order to prove [a,b] © La,c] Ulc, 6] it is enough to show 
that |z,y] C Lz,c] ULce, yJ. But we have |z, y| = [z, y], Lv, c] =[z, c), 
Lc, y] = (c, y], so the required follows from [1, Proposition 2.26]. 

(f) Suppose on the contrary that there exists an edge (vo, v1) in the 
set Ed|a, b| \ (Edla, c] U Edlc, b]). This implies that one of the points 
ug and vz lies in Lc, a] \ Lc, bJ], while the other one is in |c, b] \ [c, a]. 
Then vo is not in |v, c], while vz is not in |vo, c]. 

If [v1,c] N Lvo,c] is empty, then (c) implies that vg and vw are in 
distinct Z-subtrees of I so that (vp, v1) is not an edge — a contradiction. 

If Lv, c] Nv, c] is not empty, then [1, Proposition 2.28] implies that 


Lvo, 1} Lu, cl] A lu,c] 2S. 
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Suppose v € Luo, v1] Avo, c] N Lu, c] 4 B. Then v ¢ {u9, v1} because up 
is not in |v, c], while v is not in Luo, c]. Therefore, [vo, v1] = [vo, v, v1], 
which means that (vg, v1) is not an edge — a contradiction. 

(g) If at least one of the intersections [a,b] NM La,cJ, La,b) nN 
Lb, cl, La,c] A |b, c| is empty, then La, b| NLa,c] Nb, c] is also empty 
and the statement follows. 

Suppose that all three intersections [a,b] M la,cl, [a,b] A 
Lb,c], La,c] Nb, c] are nonempty. By [1, Proposition 2.28] it follows 
that 

La, b] NLa,c]N Lb, cl] #2@. 
Suppose v € La, b] NLa,c] N Lb, c]. Hence, from (a) we have 
La,b} =la,vJULv, oJ, La,c]=La,vJULv,c], Lb,c] = Lb,vJ Uv, c] 
and 


La, 6b] N La, c] = (La, vj U Lv, J) A (La, vJ U Lv, c}) 
= La, v] U(La,v] Nv, cJ) U (Lb, vo] N Lv, al) U (Lb, vJ N Lv, c}) 
= La, vj] U{v} U{v} U {v} = La, uv]. 
Hence, apply (a) again and obtain 
La, b] NLa,c]N Lb, cl = La, v] N Lb, cl] = La, v] N (Lb, vu] UL, cl) 
= (La,v} 116, v]) U (La, vO Lv, e}) = {v}. 


Since the intersection La, b| N La,c|N [b,c] is just a single point, the 
required follows. 

(h) Suppose on the contrary that there exists « € Lc,a]M[Lc, bJ such 
that x ¢ Lc, z]. By (e), we have Lc, 6] € Lc, z] U Lz, bJ. Since x € |e, b} 
and x ¢ |c,z], it follows that z € |z,b]. At the same time, (e) implies 
that we have |c,a] C |c,z] U|z,a]. Since x € [c,a] and z ¢ [c,z], 
it follows that x € [z,a]. Thus, it is shown that x € |z,b] N |z, a]. 
However, |z, b] N Lz, a] = {z} by (a). Consequently, we have x € {z} — 
a contradiction. 


Next, recall that in Section 3.1 we introduced the notion of the median 
Y (a, y, z) for any triple of points {z, y, z} C T. We extend this notion to 
lr: for a,b,c eT 


e set Y(a,b,c) =aifa=bora=c, and set Y(a,b,c)= bifb=c, 

e if one of the points a, b,c of I, say c, is an open cut c = (eg, €1) € 
Cuts(), where e; € Ends(f;),l = ToL]T1, such that La,c} C To 
and |c,b| CT , then set Y(a, b,c) =, 

e set Y(a, b,c) = La, b] NLa,c] Nb, c| otherwise. 


Observe that Y(a, b, c) is correctly defined for any triple a,b,c € I. 
Indeed, suppose a,b, and c are all pairwise distinct and one of the 
intersections 
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La,6]O a,c], La,bJNLb,cl, Lac] NL, cl], 


say La,c] Nb, c], is empty. In this case, by the assertion (c) of Lemma 
4.2 we have the case covered in the second part of the definition above. 
Now, if all three intersections |a,b]N La,c], La,b] N Lb, cl, La,c] 
Lb, c] are nonempty, by the proof of assertion (g) of Lemma 4.2, La, b| 
La, c] Nb, c| is a single point v € T and by the definition above we have 
Y(a,b,c) =v. 

Now we define a metric on I. Consider the set Ed(I) which is 
countable by assumption. Let f: Ed() ~ R+ be a summable positive 
real-valued function, that is 


a f(e) < +00. 


ecEd(L) 
We define the function 
dp: xT >R (13.6) 


by setting 
d(a,b)= > fee). 


ecEd|a,b] 


Then dy is clearly a metric on I (the triangle inequality for dy follows 
from the assertion (f) of Lemma 4.2). 


Lemma 4.3. Let {aj} and {b;} be two sequences of points in the metric 
space (I’, dp). Then 


(1) the sequence {a;} is fundamental (i.e. Cauchy sequence) if and 
only if for every edge e € Ed(I) there exists N > 0 such that 
e ¢ Edla;,a,| whenever 7 > N andk > N, 

(2) the sequence {dp(ai, bi)} tends to zero if and only if for each e € 
Ed(L), the set {1 € N | e € Edlaj, bj]} ts at most finite, 

(3) the sequence {a;} converges to a point t € T if and only if for each 
e € Ed(L) the set {i E N | e € Edlaj;, t]} ts at most finite. 


Proof. (1) If {a;} is fundamental and e € Ed(I), then there exists N > 0 
such that dy(aj, ax) < f(e) whenever j > N and k > N. It follows that 
e ¢ Edla;, az] whenever j > N andk > N. 

Conversely, assume that for every e € Ed(I) there exists N(e) > 0 
such that e ¢ Ed|aj, az] whenever 7 > N(e) and k > N(e). Observe 
that, since f is summable, for any 5 > 0 there exists a finite set E'¢(5) C 
Ed(I) of edges such that 

> f(e) <6. 


e€Ed(P)x Ej (6) 


Let N = max{N(e) | e € Ey¢(6)}. Then dp(a;, ax) < 5 whenever j > N 
and k > N. This means that {a;} is fundamental. 
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(2) If {dy (ai, b;)} tends to zero and e € Ed(I), then there exists N > 0 
such that df(aj,b;) < f(e) whenever 7 > N. It follows that the set 
{7 EN | e € Edla;, b;|} consists of at most N elements. 

Conversely, assume that for every e € Ed(I) there exists N(e) > 0 
such that e ¢ Ed|a;,b;] whenever 7 > N(e). Let Ey(5) be the finite 
set described in the proof of (1) and let N = max{N(e) | e € Ef(6)}. 
Then d(a;, bj) < 6 whenever j > N. This means that dy(a;, b;) > 0 as 
i > oo, as required. 


(3) follows from the previous assertion by setting b; = t for every i. 


Proposition 4.4. The topology of the metric space (I, df) is indepen- 
dent of the choice of the summable function f: Ed(T) > Rx. 


Proof. It is well known (and easy to check) that the topology of a 
metrizable space is determined by its convergent sequences. By Lemma 
4.3(3), the set of convergent sequences in (I, ds) is independent of the 
choice of f. 


For an edge e € Ed(I’) denote by Re the binary relation on the set of 
points of IP defined by 


(abe Re @& ec € Eda, 0d]. 


It is easy to see that Re is an equivalence relation (transitivity follows 
from the assertion (f) of Lemma 4.2), which has precisely two equivalence 
classes (follows from the assertion (g) of Lemma 4.2). 


Proposition 4.5. I is compact in the topology induced by the 
metric dp. 


Proof. It is enough to show that every infinite sequence {x;} of points 
in (I, df) has a convergent subsequence. 

Let us first show that {x;} has a fundamental subsequence. Since Ed(T) 
is countable, {2;} has a subsequence {xi,} such that for each e € Ed(T) 
there exists N(e) > 0 such that (Xi;, 2%) € Re (which is equivalent 
toe ¢ Ed|2j,, £%, |) whenever 7 > N(e) and k > N(e). Then, {2;,} is 
fundamental by the assertion (1) of Lemma 4.3. 

Now, we show that every fundamental sequence {yj} in (I, df) 
converges. Fix an arbitrary point y € P and consider the linear subtree 


b= (lal. 


keNj>k 


There exists a unique point t € TP such that L = Ly, t] (see Lemma 4.1). 
We show that {y;} converges to t. Assume that it does not. Then, by 
the assertion (3) of Lemma 4.3 there exists an edge e and an infinite 
subsequence {yj} in {yi} such that e € |t,y,] for all j. Since {yj} is 
fundamental, it then follows by the assertion (1) of Lemma 4.3 (and by 
the assertion (f) of Lemma 4.2) that e € [t, y;] for all 7 sufficiently large. 
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In the case where e ¢ |y,t], we have e € ly, y;]| for all sufficiently 
large i (we use the assertion (f) of Lemma 4.2), whence e € Ly,t] (by 
definition of L and t), a contradiction. 

In the case when e € |y, t] we have e ¢ | y, y;| for all sufficiently large 7 
(we use the assertion (g) of Lemma 4.2 and the fact that e € |t, y:J) 
whence e ¢ |y,¢t] (by definition of Z and t), a contradiction. 

This proves that {y;} converges to ¢t and the proposition is thus 
proved. 


Remark 4.6. It is obvious from the construction that P is dense in 
(I’, dy). Therefore, Proposition 4.5 implies that I’ is the compactification 
of T’ with respect to df. 


4.2. Group Action on I 


Let I be a countable Z”-tree and I’ be the compactification of I with 
respect to the metric dp constructed in Subsection 4.1. 

Observe that an automorphism a: T — IP sends I’-rays to ’-rays, ends 
to ends, open cuts to open cuts, and therefore induces an automorphism 
a:ror. 


Proposition 4.7. Every automorphism of I is a homeomorphism with 
respect to the metric dy. 


Proof. Ifa: T > I is an automorphism, let fy: Ed(T) > Ry be the 
function defined by 


fale) := f(a*(e)). 


Then @ is an isometry between (I, dr) and (I, dy,). Therefore, a is a 
homeomorphism because df and dg, induce one and the same topology 
by Proposition 4.4. 


Theorem 4.8. Jf a group G acts onT by isometries, then the induced 
action of G onT is continuous with respect to the topology generated by 
the metric dg. Moreover, if G is nonabelian and acts on T freely, without 
inversions, and so that T has no proper G-invariant subtrees, then the 
orbit of every point in T is infinite. 


Proof. The first part of the statement follows from Proposition 4.7. 

Now suppose there exists v € F such that {G - v} is finite. It follows 
that |G: Stabg(v)| < oo. 

If v € T, then T must be spanned by {G - v} because otherwise 
contains a proper G-invariant subtree. But then, since the action is 
isometric, it follows that either there is a fixed point, or an inversion—a 
contradiction in either case. Thus we have v € Ends(T) U Cuts(T). 

Suppose v € Ends(I’). By [7, Lemma 3.1.9], if v is fixed by g € G, then 
v isan end of Azis(g), which is a maximal linear g-invariant subtree of I. 
If v is fixed by another element h € G, then v is an end of Azis(h) too, so 
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Axzis(g) = Azis(h), from which it follows that [g, h] = 1. Thus, Stabg(v) 
is abelian and G is virtually abelian. But G is Z”-free, so commutation 
is transitive and it follows that G is abelian — a contradiction with our 
assumption. 

The case when v € Cuts(I) is considered similarly. 


Theorem 4.9. Assume that a group G acts on TI by isometries and 
I has no proper G-invariant subtrees. Then the induced action of G on 
the set of ends Ends(I’) either has a global fixed point, or is topologically 
minimal (with respect to the topology induced by the metric dy ). 


Proof. Assume on the contrary that the action of G on Ends(I) neither 
is topologically minimal, nor has a global fixed point. Then Ends(T) has 
a proper closed (in Ends(T)) G-invariant subset K containing at least 
two points. Set 


Pee -||.) (ley blk 


a,be Kk 


We are going to obtain a contradiction by showing that x is a proper 
G-invariant subtree of I. 

First, observe that [x #4 @. Indeed, since K contains at least two 
points a # b, we have "x D [a,b] AG. 

Next, observe that [x is a subtree in I’. Indeed, if z,y € x, then 
xz € |a,b| and y € [c,d] for some a, b, c,d € K. Interchanging a with b 
if necessary, we can assume that b € c. Then 


La, bJNLb,cl FOSAlb,c]O Le, d], 


so that the union 
N= |[a,b] ULb,c] ULe, d] 


is a subtree by [1, Lemma 2.13]. Therefore, we have [x,y] C N CI'x. It 
is thus shown that [x,y] C [x whenever z,y € x, which means that 
Ix is convex and hence (being nonempty) is a subtree of I’. 

Since K is G-invariant, so is [x. It remains to show that [x #T. 

Since K is a proper closed subset in Ends((), we can choose p € 
Ends(T) and ¢ > 0 such that dist a, (p,K) > e. We show that there 
exists an edge e in I such that [k, p| contains e for each k € K. Since f 
is summable, there exists a finite set E(e) C Ed(I’) of edges such that 


ie f(e) <6. 
ecEKd(D)~ Ef (€) 


We denote by M the set of all the vertices of the edges in Ey (¢). Observe 
that the intersection 
L= { \[v,p) 


veM 
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of T-rays is a '-ray because M is finite. Let e be an edge in L. Ifk € K, 
then |k,p] contains an edge of Ey(e) because dist dy (p,K) > «€ by 
construction. Therefore, |k,p| contains a point v € M, whence we have 


e € Ed(L) C Ed|v, p) C Edk, p]. 


Now, we can show that |z,p]| contains e for each x € Ix. Indeed, if 
xz € Tx, then by the definition of [x there exist a and b in K such that 
x € [a,b]. We see that [a,b] does not contain e by the assertion (g) of 
Lemma 4.2 because |a,p| and [b,p| both contain e as proved above. 
Consequently, |x, a] does not contain e also, being a subset in La, b]. 
Therefore, |z, a] does not contain e while |a,p| contains e. Then |[z, p| 
contains e because we have La,p| C |x,a] Ux, p] by the assertion (e) 
of Lemma 4.2. 

Thus, we have shown that dist a, ((x,p) => f(e) > 0. Consequently, 
lx €T. This completes the proof. 


4.3. Convergence in fF 


In the space I under investigation, there is a strong relationship between 
the convergence of points and the weak convergence of measures. For 
example, if [ is an ordinary Z!-tree (n = 1), up a vertex in T, z is an 
end in Ends(I), v is a continuous Borel probability measure on I’, and 
{a;} is a sequence of automorphisms of I, then the following conditions 
are equivalent: 


(i) the sequence {a; - up} converges to z, 
(ii) the sequence {a; - v} weakly converges to the Dirac measure 6,. 


If n > 1, then (i) and (ii) are no longer equivalent. However, in the 
general case, the relationship mentioned above shows up when we impose 
appropriate restrictions on z and/or {a;}. In fact, our proof of the main 
theorem is based on this relationship (see Corollary 5.4 and Proposition 
5.8). In this subsection, we do some preliminary work that will allow us 
to prove Corollary 5.4 and Proposition 5.8. 

Recall from Subsection 3.2 that Gp_1(P) is the set of all maximal 
Z”"-1-subtrees of I. 


Definition 4.10. Define the function h,: Tx PF > [0,+00] as follows. 
Set f(a, a) = 0 for each a € T and 


hn (a,b) = card{S € Gp_1(T) | SO La, 6] SB}, 
for distinct a,b eT. 


That is, f(a, 6) is the number of distinct maximal Z”~!-subtrees of 
I intersecting with |a, 6]. Observe that if a and 6 are distinct ends of 
I, then fi,(a, b) may be equal to +00. 
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Lemma 4.11. The function 
fin: T x T > [0, +00] 


is an extended metric on I’, that is, hn satisfies the standard axioms 
of metric, but can attain the value +oo. This extended metric is 
G-invariant. The restriction of this extended metric to T is a metric. 


Proof. Assertion (e) of Lemma 4.2 implies that fy satisfies the triangle 
inequality. The other required properties of i, immediately follow from 
its definition. 


Recall that € is a designated point in I’. By abuse of notation, for 
pel, we set in(p) = hyle,p), and for g € G, we set 


An(g) = hn(g- &) = hnlé,g- €). 


Lemma 4.12. Let {g;} be a sequence in G, let ug be a point inT, and 
let z be a point in. Assume that the sequence {g; - vo} converges to z 
and that the sequence {hn (g;)} tends to infinity as i > oo. Then all the 
sequences {gi - vu} with v ET converge to z. 


Proof. Since the action of G on I is isometric with respect to the 
extended metric fin, the assumption that {fn(gi)} = {hn(gi- €,€)} tends 
to infinity implies that {fin (gi-v)} tends to infinity for each v € [. Indeed, 
we observe that fin(gi-&, gi: v) = hn(e,v) = hn(v) is finite, while the 
triangle inequality (see Lemma 4.11) implies that 


hin(gi-¥) = hn(gi-v,€) = hn(gi-€,€) — hin (gi &, 9°) = hin (gi) — hn (v). 


In particular, {h»(g; - vo)} tends to infinity. 

Now, we take v € I, put & := fin(uo,v), and observe that for each 
i € N the geodesic segment [gi - vo, gi - v| is contained in the fhy-ball of 
hn-radius k centered at g;- up. Since {fin(g; - vo)} tends to infinity, this 
implies that the h,-distance between ¢ and [g;- v9, g;-v| tends to infinity. 
Therefore, for each e € Ed(I) the set {2 € N | e € Ed[g;- vo, 9; - v|} is at 
most finite. The result then follows by the assertion (2) of Lemma 4.3. 


Lemma 4.13. Let {a;} and {b;} be two sequences of points in T. 
Suppose that hn(Y(e, aj, b;)) > co as i > oo. Then dp(aj, bj) > 0 
ast —> oo. In particular, {aj} converges to a point w € T if and only if 
{bi} converges to w. 


Proof. Observe that if c ¢ T and y € T belongs to le, zJ (as a point 
or an open cut), then in(y) < hn(z) for every z € Ly, x] (follows from 
definition of hn). 

Next, from the assertion (h) of Lemma 4.2 it follows that for any 
a,b €T such that [a,b] 4 @ we have 


le,a] Le, b) C Le, z] 
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for any z € La, b|. Hence, it follows that 


min hy»(z)=hpl(Y (e, a, b)). 
zela,b| 

Therefore, we have minze|a,b|fn(z) = fn(Y (e, aj, b;)) for all i € N. 
Since we assume fin (Y (é, aj, b;)) > co asi > ow, it follows that for 
each v € T the set {i e N | v € [a;, b;|} is at most finite (because we 
have v ¢ [aj, b;] whenever minze)q,pjhn(z) > hn(v)). Then the result 
follows from the assertion (2) of Lemma 4.3. 


Lemma 4.14. For every vertex v of a Z"-tree T, the set 
Iy = {(a, 6) € Ends() x Ends(L) | v € La, bJ} 


is open in Ends(I) x Ends(1) with respect to the product topology induced 
by the metric dy. 


Proof. Let (a,b) € Iy and let p € Ends(I) be such that dp(b,p) < 
ds (v, 6). Then v ¢ |p, 6]. However, we have [a,b] © La,p] U Lp, 6] by 
the assertion (e) of Lemma 4.2. This implies that v € |a,p], that is, 
(a, p) € Ip. 

By repeating the same argument we readily see that (r,p) € Iy 
whenever d¢(b,p) < dg(v,b) and dy(a,r) < dp(a,v). This obviously 
implies the lemma. 


Remark 4.15. It can be shown that J, from Lemma 4.14 is also closed. 


5. Proof of Theorem 1.1 


In this section, we prove Theorem 1.1 according to the scheme outlined 
in the introduction. For the rest of this section, we fix n € N and 
a countable nonabelian Z”"-free group G acting minimally, freely, and 
without inversion on a Z"-tree I’. Also, we let (I, dy) denote the compact 
metric space constructed in Subsection 4.1 for IP. Recall that Ends(T) 
denote the set of open ends of I’. We also fix a non-degenerate probability 
measure jp on G. 


5.1. p-proximality of Tf and Ends(T) 
Proposition 5.1. [fv is a w-stationary measure on T then 
(i) v ts continuous, 


(ii) v(Ends(P)) = 1, : 
(iii) v(Z) > 0 for every nonempty open set E CT with ENEnds(T) 42. 


Proof. (i) Continuity of v follows from Lemma 2.2 because the action 
of G on T has no finite orbits (see Theorem 4.8). 
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(ii) The set I’ is countable by assumption. Therefore, Cuts(I’) is count- 
able. Then, since v is continuous, we have v(I') = 0 and v(Cuts(L)) = 0. 
Hence, v(Ends(P)) = 1. 

(iii) From Theorem 4.9 it follows that either G acts on Ends(IT) 
minimally, or there is a global fixed point. If all elements of G fix 
e € Ends(I) then G is abelian and we have a contradiction with the 
set of assumptions on G. Hence, G acts on Ends(T) minimally and now 
(iii) follows from (ii) by Lemma 2.3. 


Define G to be the subset in GN consisting of all sequences {gi} with 
the following three properties: 


(i) {gi -€} converges in (I, de), 
(ii) i - e} converges in (I, df), 
(iii) {in (gi)} tends to infinity as 1 > oo (recall Definition 4.10). 


Proposition 5.2. A.e. path {t;} of the u-random walk contains a 
subsequence {tj,} € G. 


Proof. First, we prove that a. e. path {t;} of the w-random walk contains 
a subsequence with the property (iii) from the definition of G. 

Let d € N. Since G acts on I’ minimally, it follows that there exists 
an element goq € G such that hp(goq) > 2d. Since uw is a nondegenerate 
measure, there exists k € N such that p** (gaa) > 0. It follows that for 
a.e. path {t;} of the w-random walk there exists m € N such that tm+~ = 
Tmg2d (to see this, use ergodicity of the Bernoully shift in the space of 
sequences of p**-equidistributed random variables (tig TiktRLIENo» or 
observe that the measure P, of the set of all paths of the y-random 
walk with tjp4% A tjkg2aq for each 7 € N equals lims+.o(1 — 5)*° = 0, 
where 6 = u** (goa) > 0). Then 


nT Tm+k) = An (Ga) > 2d. 
Now, we apply the triangle inequality (see Lemma 4.11) and deduce that 
max{hin(Tm+k)s hn(tm)} > d. 


We have thus shown that for any d € N, the path {t;} a.s. has an 
element t; such that h,(t;) > d. It obviously follows that {t;} a.s. has a 
subsequence {t;} such that {f,(t;)} tends to infinity (the property (iii)). 

Since (I, df) is compact, each infinite sequence in G (and {r/} in par- 
ticular) contains a subsequence having both properties (i) and (ii). 


Lemma 5.3. Let {g;} be a sequence from G. Let w and @ be points 
from T’ that the sequences {gj - ©} and {g;! -e} converge to. Then, for 
each p€ PT \@, the sequence {g;- p} converges to w. 
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Proof. In order to prove the lemma, we will show that for every p € 
I \ @ we have 


iin (pi) ——> 0, (13.7) 
17> CO 
where pj = Y(€,9i-&, gi: p), and then apply Lemma 4.13. 
Observe that for each i we have 9; -pp= Yee, ge -€,p) because 


9) pig: Y(E,9°E,9 °>DI= Y(g;*-€,8,p) = Y(e,9, + 8,7). 


Let us prove that the sequence {fin(Y (e, 9; - €,p))} is bounded. 
Assume that it is not. Then it has a subsequence {h,(Y (e, 9," -€,p))} 
which tends to infinity. Then, by Lemma 4.13, { 9," -e} converges to p, 
which contradicts the assumption that je -e>@o#&p. Thus, {in(Y 
(€, Ge -€,p))} is bounded, that is, there exists N € N such that 

Rik Y (ede -€,p))< N foreach ieé€N. 


Therefore, for each 1 € N we have 
_ def = = 
hn (Gi -€, Pi) = hale, 9; Pi) = hn(g; Pi) = hn(V (6,9; '+8,p)) < N. 
Now, {gi} € 6, so 
def def 
hin(€, 91-8) = hn(gi-£) = hn (gi) 
tends to co (the property (iii)), while we have 


hn(é, pi) + hn (Wi, 9° ©) = hin€, % - &) 


by the triangle inequality (see Lemma 4.11), whence it follows that 
hn(pj) > oo. Finally, by Lemma 4.13, this implies that {g;-p} converges 
to w since {g; - €} does. 


Lemma 5.3 obviously implies the following corollary. 


Corollary 5.4. Let {gi} be a sequence from G with {g; +e} converging 
to a pointw eT. Then, for any continuous Borel probability measure 
onl’, the sequence {g;-A} converges to the Dirac measure dy. 


Corollary 5.5. If v is a u-stationary measure on T, then (T,v) is a 
p-boundary of (G,u) in the sense of Furstenberg. In other words, the 
G-space T is mean-proximal. 


Proof. Let {tj} be a path of the w-random walk. Then by Theorem 
2.4, the sequence {t; - v} a.s. converges to some limit. By Proposition 
5.2, {t;} a.s. contains a subsequence {ti} € G6. By Corollary 5.4, the 
subsequence {t;, - v} converges to a Dirac measure (since v is continuous 
by Proposition 5.1). Therefore, the limit of {t;-v} is a.s. a Dirac measure. 
This means by definition that ([,v) is a u-boundary of (G,j) in the 
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sense of Furstenberg. Since yw is an arbitrary nondegenerate measure, it 
follows by Theorem 2.10 that the G-space I is mean-proximal. 


Now, we can draw the main result of this subsection from the results 
above. 


Proposition 5.6. There exists a unique -stationary measure v,, on r. 
This measure is continuous and concentrated on Ends(T). The measure 
space (I', vy) 18 a 4-boundary of G in the sense of Furstenberg. 


Proof. Observe that uniqueness of a j-stationary measure v, follows 
from Corollary 5.5 by Theorem 2.10 (the last one is applicable, because 
I is compact and metrizable). The rest follows from Proposition 5.1 and 
Corollary 5.5. 


Proposition 5.6 yields the following corollary (see Assertion 2.7 and 
Corollary 2.8 concerning relationship between p-boundaries in the sense 
of Furstenberg and in the sense of Kaimanovich). 


Corollary 5.7. There exists a unique p-stationary measure vy on 
Ends(T). This measure is continuous. The measure space (Ends(T), v,,) 
is a t-boundary of G in the sense of Kaimanovich. 


5.2. Stability of Paths in Z”-Free Groups 


Proposition 5.8. Let {g;} be a sequence in G, and let z be a point inT. 
Assume that the sequence {9;-V,} converges to the Dirac measure 5,.Then 
all the sequences {g;-v} with v € T converge to z. 


Proof. Let v €T. Since I is countable and admits a minimal action of 
a group, it follows that there exists a pair of ends w1,@2 € Ends(I) such 
that v € (@1,@2) (indeed, otherwise I has closed ends and the subset of 
vertices that are not closed ends forms a proper invariant subtree, which 
contradicts the minimality assumption). 

Lemma 4.14 implies that there exist nonempty open subsets A and 
B in Ends(P) containing respectively w; and w2 such that v € (a,b) 
whenever a € A, b € B. By assertion (iii) of Proposition 5.1 we have 
v,(A) > 0 and v,(B) > 0. Therefore, since {g; - v,} converges (weakly) 
to 6,, it follows that for every r > 0 there exists N > 0 such that 
dp (z, 9; - aj) < r and dr(z, 9; - b;) < r for some a; € A and bj € B 
whenever i > N. 

Note that the balls of the metric dy are geodesically convex. In 
particular, the conditions 


GH VEMG + A, gi- bi), Ugle,gi-ai)<r, and dp(z,q- bi) <7 
imply that dy(z, 9; - v) < r. Indeed, we have 


GV EG: UG, Gi° bi) = Loe Gi, G6 bi] C LZ, 96+ ai] ULZ, gi - bil 
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by Lemma 4.2, so that 
a (z, giv) Sd(z,gi-a)<r if gi-ve lz,q- ail, 


ag(z, gv) Sdg(iz,g- bi) <r if ge ve lz, gi: bi). 


It is thus shown that for any r > 0 there exists N such that df (z, gi-v) 
< r whenever 7 > N, that is, the sequence {g; - v} converges to z. 


Corollary 5.9. For almost every path t = {t;} of the u-random walk, 
all the sequences {t;- v}, where v ET’, converge to a random end w(t) € 
Ends([); the distribution of the limits w(t) ts given by the w-stationary 
MEASUTE Vy. 


Proof. Proposition 5.6 says that the measure space (I',v,) is a 
p-boundary of G in the sense of Furstenberg. This means by definition 
that, for almost every path t = {tj} of the mw-random walk, the 
sequence {t; - v,} converges weakly to a random Dirac measure 64,, 
where z= z(t) ET. 

Then Proposition 5.8 implies that all the sequences {t; - v}, where 
v € T, converge to z(t). By Assertion 2.7, the distribution of the limits 
z(t) is given by the measure v,,. Since v,,(Ends(I’)) = 1 (see Proposition 
5.6), this implies the required assertion. 


5.3. Maximality of the Boundary Ends(I) 


According to Corollary 5.7, the pair (Ends(T), v,,) is a u-boundary of G. 
Now we would like to show that it is a maximal p-boundary. 

Let X be a measurable G-space with a (quasi-invariant) measure v. 
Recall that the action of G on X is called ergodic if v(Y) € {0,1} for 
each G-invariant measurable subset Y Cc X. 


Lemma 5.10 ({15]).. Let G be a countable group, uw € P(G) and jt € 
P(G) the reflected measure defined by fi(g) = w(g—'). Let (My, v4) and 
(M_,v_) be respectively a 4-boundary and a ji-boundary of G. Then the 
action of G on the product (M_ x M,,v_ x v4) 18 ergodic. 


Observe that in [15], Lemma 5.10 is stated and proved in the case when 
(M,,v+) and (M_,v_) are maximal, respectively, - and ji-boundaries. 
But the proof works verbatim for the general case. 


Proposition 5.11. Let G be a finitely generated nonabelian Z" -free 
group acting on a Z"-tree T for some n € N so that T has no proper 
G-invariant subtrees. Let uw be a nondegenerate probability measure on 
G and v, the unique -stationary measure on the space Ends(I°) of open 
ends of [. If u has finite first moment with respect to a finite word metric 
on G then the measure space (Ends(T),v,) 1s a maximal p-boundary 


of G. 
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Proof. In order to prove the theorem, we are going to construct a map 
S: Ends(P) x Ends(r) > 2° 


and then show that it satisfies all the requirements of the Strip Criterion 
(see Corollary 2.11 and Remark 2.12). 

Let a, b € Ends(I). There exists a unique bi-infinite geodesic (a, 6) in 
I from a to b. We define 


S(a,b)={ge Gl g-e€ (a, bj}. 


Obviously, the map S: Ends(l) x Ends(T) > 26 is G-equivariant. 
Claim 0. The map S is measurable in the sense that S~!(M) is 
measurable for any MCG. 
For a vertex v € I’, define 


Qy = {(a, b) € Ends(T) x Ends(L) | v € (a, b)}. 
Our definitions immediately imply that 
S(a,b)={g € G| (a,b) € Qg.6 = 9+ Qe} (13.8) 
so that for each g € G we have 
SAG= 2s 


By Lemma 4.14, for all v € T the sets Q,y are open and hence Borel 
measurable. In particular, the set S~!(g) = Q,-1., is Borel measurable 
for every g € G. Hence, Claim 0 follows by the countability of G. 

Claim 1. Let v,_ be a p-stationary measure on Ends(T) and v_ be 
a jt-stationary measure on Ends(T), where jt is the reflected measure 
of w defined by jt(g) = wg"). Then for (v— xX v4)-a.e. pair (a,b) € 
Ends(() x Ends(), the set S(a, 6) is non-empty. 

It follows from (13.8) that 


S(a, 0) =1¢ 2G |g) (a,b) Qh (13.9) 


Given that T is countable and admits a minimal action of a group 
(while Ends(T‘) 4 @), it follows that Qe is not empty (indeed, otherwise 
I would have closed ends and the subset of vertices that are not 
closed ends would form a proper invariant subtree, which contradicts 
the minimality assumption). 

Lemma 4.14 says that Q, is open with respect to the product topology. 
Therefore, since Q, is not empty, there exist nonempty open sets A, BC 
Ends(T) with A x B C Q,. Applying the assertion (iii) of Proposition 
5.1 to the measures v_ and v,, we see that (v_ x v+)(Qe) > 0. 

Recall that the action of G on the space (Ends(T) x Ends(I), v_ xv) is 
ergodic (Lemma 5.10). Consequently, since (v_ x v_)(Q-) > 0, it follows 
by (13.9) that for (v_ x v4)-a.e. pair (a, b) € Ends(f‘) x Ends(P), the 
set S(a, 6) is nonempty and Claim 1 follows. 
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Now, recall that by assumption G is a finitely generated nonabelian 
group acting freely and without inversions on I’, and F has no proper 
G-invariant subtrees. Let Z be a finite generating set of G and let |-|@¢ 
be the word metric on G with respect to Z. 

To finish the proof of the theorem we are going to prove the following 
claim. 

Claim 2. For any m € {1,...,n} there exists C(m) € N such that 
for any mazimal Z™-subtree T of T, and any distinct a,b € Ends(T) 
we have 


card({g¢ G|g-e€ (a,b)}N Bg(k)) < C(mk™, 
where BG(k) ={9 € G | lgla < k}. 


Fix a maximal Z’’-subtree T and assume that there exists at least 
one g € G such that g-e € (a,b), otherwise the claim holds trivially 
for T. Define H: = Stabg(T). Since G is finitely generated, H is also 
finitely generated, and it follows that H is quasi-isometrically embedded 
into G (see Subsection 3.2). Let R be a finite generating set of H and 
let |-|4 be the word metric on H with respect to R. 

Denote by T,m the maximal Z™-subtree of P containing ¢. In order 
to prove the claim we use induction on m. 

If m= 1 then T is a Ztree and (a, b) is isometric to Z. Consider two 
cases. 


(1.1) Assume that T = T,,1, that is, e € T. If g € G is such that 
g-é € (a,b), then g € A and we have 


{ge Glg-e€ (a,b}={(g9€H|g-€ € (a, b)}. 


Next, observe that H is a finitely generated free group which 
quasi-isometrically embeds into T by means of the map h > h-e 
for any he H. 

Now, let g € H be such that g € Beak), that is, |gla < k. 
Since H quasi-isometrically embeds into G, it follows that |g|q7 < 
C\k + Cy for some constants C1, Co € N which depend only on 
Hand G. Then, it follows that 


dr(e,g-&) < C3(Cik + Ch) 4+ Ch < Ck, 


where dr is the distance on T inherited from T and C’ depends 
only on H and T. But now observe that since (a, b) is a linear 
subtree of 7, the number of points on (a,b) at distance not 
greater than C’k from ¢ is bounded by 2C’k + 1. At the same 
time, the action of H on T is free, so, the number of elements 
g € H such that g € Ba(k) and g-« € (a, b) cannot be greater 
than 2C’k + 1. It means that the required inequality 


card ({g € G| g-e € (a, b)} Ba(k)) < C(Te1)k 
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holds for C(T¢,1) = 2C’ +1. 
(1.2) Assume that T # T,1. Observe that if 


gp E{geGl|g-e€ (a,b)}N Belk) 
then a. -a and % - 6 are distinct ends of T,,1 and we have 


card ({g € G| g-e € (a, b)}N Batk)) 


< card ((9 EGlg-e€ (g-a,g'-b)}N Bo(2k)) 


Indeed, for every g, € {9g € G| g-e € (a, b)}N Beak), the element 
9 91 belongs to Bg (2k) since 


loo ‘gle <lgla +Igile < 2k, 


and 


(9991) °€ € (99° 4.99" * b)- 
In other words, gy ‘91 € {9 € Gl g-€ € (gp +4, 99. b)}NBG(2k) 
and the required inequality follows. Finally, from (1.1) we have 
that 


card ((geG | g-€€ (91+ a, 951 + 0)}M Bo(k)) < C(Tea)-2h). 
That is, C(T) = 2C(T.,1). 


Hence, the constant C(1), which works for every maximal Z-subtree 
of I can be taken to be 2C(T,1). This concludes the proof of the claim 
in the case when m = 1. 

Suppose m > 1 and assume that Claim 2 holds for m — 1. 

Recall from Subsection 3.2 that one can define the metric d»_1 on the 
set Em—1(T) of all maximal Z™~!-subtrees of T and dm_1 takes values 
in Z. Consider two cases. 


(m.1) Assume that T = T; m, that is, e € T. In particular, T contains 
Ts,m—1- 
Observe that (a,b) may have nonempty intersection with 
at most countably many Z’~!-subtrees of T, which we can 
enumerate by integers 


Perera! Gitar re eereNes Aiai nd 4 gy! fy Ieereere Acree 


wh 


where 7p is the ‘closest’ one to T.,m—1 with respect to the metric 
dm—1.- 

There exists a constant M > 0 such that for every h € {g € 
G|g-e€ (a,b)}N Bek) we have 


dm—1(Te,m-1; h- Ts,m—1) < Mk. 
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Indeed, because h-e € (a, 0), it follows that h ¢ H. Next, since H 
quasi-isometrically embeds into G, from |h|g < & it follows that 
lhly < Cik + Cy for some constants Ci, Co € N which depend 
only on H and G. Now, dm-_i satisfies the triangle inequality, 
so if 
[= max dm—1(Ts,m-1, hy Ts,m—1) 
hieR 
then 
dm—1(Te,m—1, + Ts;m—1) < Ls (Cik + C2) 


and existence of the required M follows. 
Let 


Th: ={S €{Ti} | dm-10S, Tepm—1) < Mk}. 


Obviously, card(7;,) < 2Mk+1 and foreveryhe{geG|g-e€ 
(a, b)}O Be(k) there exists S € 7, such that h-e € (a, b)ANS = 
(ag, 0g), where ag, bg € Ends(S). Thus, we have 


card ({g € G| g-@ € (a, b)}N Batk)) 


< DO card({g€ G|g-e € (ag, bs)} MN Ba(k)) 
SeT, 


< (2Mk + 1)(C(m — )k™}) < C(Tem)k™, 


where C'(T.,m) is a constant which depends on Tym and Z, but 
not on a and 6. 
(m.2) Assume that T # T,m, that is, e ¢ T. 
If 


9 €{ge Gl g-e€ (a, b)}N Belk) 
then using the same argument as in (1.2) we can show that 


card ({g € G| g-@ € (a, b)}N Batk)) 


< card (tg EG|g-e€ (91-4, 951+ BN Bo(2k)) 


Then from (m.1) we obtain 


card (t9 EG|g-e€(gt-a,gt-by}n Bo(2k)) 
< C(Te,m) > (2k)™. 
That is, C(T) = 2™C(Te,m). 


By setting C(m): = 2™C(T,;m) we finish the induction step and 
Claim 2 is proved. 
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Abstract 

In this chapter we survey the theory of random walks on buildings and 
associated groups of Lie type and Kac—Moody groups. We begin with 
an introduction to the theory of Coxeter systems and buildings, taking a 
largely combinatorial perspective. We then survey the theory of random 
walks on buildings, and show how this theory leads to limit theorems for 
random walks on the associated groups. 
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Introduction 


Probability theory on real Lie groups is a classical area, with beautiful 
results obtained in the 1960s, 1970s, and 1980s (see, for example 
[5, 22, 23, 29, 48, 57, 58]). It is the purpose of this chapter to survey 
more recent results dealing with probability theory on groups of Lie type 
defined over other fields, and extensions of these results into the setting 
of Kac—Moody groups. A unifying feature of these works is the use of a 
combinatorial/geometric object called the building of the group, which 
in some ways plays a role analogous to the symmetric space of a real Lie 
group. In fact, the building becomes the main object of interest, and so 
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this survey is really about random walks on buildings, with applications 
to random walks on the associated groups. 

Buildings were invented by Jacques Tits in the 1950s in an attempt 
to give a uniform geometric interpretation of semi-simple Lie groups. 
He achieved this goal spectacularly by classifying the class of irreducible 
thick spherical buildings of rank at least 3, showing that this class of 
buildings is essentially equivalent to the class of simple linear algebraic 
groups of relative rank at least 3, simple classical linear groups, and 
certain related groups called groups of mixed type (see [49]). Since their 
invention the scope of building theory has expanded immensely, with 
affine buildings playing an important role in the study of Lie groups 
over p-adic fields, and twin buildings utilised extensively in the theory 
of Kac—Moody groups. 

For the purpose of this introduction a building consists of a set A 
(whose elements are called chambers) and a way of measuring distance 
between chambers. This measurement is not simply a numerical distance, 
instead the distance between two chambers is an element of a Coxeter 
group W associated to the building. Thus, there is a ‘W-valued distance 
function’6: Ax A > W, satisfying various axioms making A into a kind 
of ‘W-metric space’ (see Definition 2.1 and Remark 2.3 for more details). 

Buildings arise naturally in connection with groups originating in 
Lie theory, and the axioms satisfied by 6 are in essence capturing 
the combinatorics of the ‘Bruhat decomposition’ in these groups. More 
precisely, if G is a group with a Tits system (B,N,W,S) then setting 
A = G/B and 6(gB,hB) = w if and only if g-'h € BwB produces 
a building (see Section 2.2 for details). While this connection to group 
theory is the raison d’étre for buildings, there are many buildings that 
are not associated in any nice way to groups (see, for example, [43]). 
This motivates the philosophy of treating the building, rather than the 
group, as the primary object of interest. 

We will give an introduction to the theory of buildings in the first two 
sections of the chapter, with Section 1 devoted to the theory of Coxeter 
groups, a necessary prerequisite to the theory of buildings. Section 2 is 
devoted to the buildings themselves and to the related group theoretic 
notion of a Tits system in a group. We will focus on the classes of 
buildings on which random walks have been studied, including: 


(1) The spherical buildings, where W is a finite reflection group. By 
Tits’ classification [49] these buildings are closely related to groups 
of Lie type such as SZ,,(F) where F is a field. 

(2) The affine buildings, where W is an affine reflection group. By the 
Tits—Weiss classification [52, 59] these buildings are closely related 
to groups of Lie type defined over fields with discrete valuation, 
such as SLy(Qpy) or SLy(K(¢t)). The simplest affine buildings are 
trees with no leaves, for example, homogeneous trees. 
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(3) The Fuchsian buildings, where W is generated by reflections in the 
hyperbolic disc H?. These buildings do not admit a classification 
(see Section 2.5), however some of them are related to certain 
‘Kac—Moody groups’. 


In Section 3 we survey results on random walks on buildings and 
associated groups. There are various types of random walks that we 
will consider. A particularly neat class consists of the isotropic random 
walks on the chambers of a building. These are the random walks 
(Xn)n>0 on the set A of chambers such that the transition probabilities 
p(2, y) depend only on the W-distance 6(z, y). These random walks arise 
naturally from bi-B-invariant probability measures on groups admitting 
Tits systems, and any limit theorems established for the random walks 
on the building imply limit theorems for these measures. 

In Section 3.1 we outline the beautiful algebraic theory of isotropic 
random walks on A. Put briefly, the transition operator of an isotropic 
random walk is an element of an algebra called a Hecke algebra. These 
algebras have been extensively studied, (largely due to their connections 
with groups of Lie type and p-adic Lie groups), and their representation 
theory plays a key role in the theory of random walks on buildings. 

In Section 3.2 we specialise to the case of finite spherical buildings. In 
this case the Hecke algebra is finite dimensional, and we give an overview 
of how the representation theory of this algebra can be applied to inves- 
tigate isotropic random walks. In particular we provide tractable upper 
bounds for mixing times for isotropic walks on finite spherical buildings. 
The analysis follows, in spirit, the work of Diaconis and Ram [18] where 
the representation theory of finite dimensional Hecke algebras is applied 
to investigate the systematic scan Metropolis algorithm. It turns out 
that this theory is related to random walks on spherical buildings, and 
so Section 3.2 is really a translation of [18] into the language of buildings. 
Other works related to random walks on spherical buildings can be found 
in Brown [8, 9] and Brown and Diaconis [7]. 

Next we consider random walks on affine buildings. In this context it 
is also natural to consider random walks on the ‘vertices’ of the building 
(these walks arise from bi-K-invariant measures on p-adic Lie groups, 
where K is a maximal compact subgroup). We will survey results on 
these random walks, and random walks on associated groups, drawing 
from the works of Cartwright and Woess [14], Lindlbauer and Voit [33], 
Parkinson [38], Parkinson and Schapira [39], Parkinson and Woess [41], 
Schapira [46], Tolli [54] and Trojan [55]. These works include precise 
limit theorems for isotropic random walks on the vertices and chambers 
of affine buildings, as well as theorems for random walks on groups 
associated to these buildings. Hecke algebras again play a key role in the 
analysis. Homogeneous trees are the simplest examples of affine buildings 
(the ‘rank 2 case’), and in this direction we mention the fundamental 
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works of Cartwright, Kaimanovich and Woess [13], Lalley [82] and 
Sawyer [45]. The literature relating to probability theory and harmonic 
analysis on homogeneous trees is extensive, and here we will focus on 
the higher rank cases. 

The study of random walks on non-spherical, non-affine buildings 
is very open territory. In Section 3.4 we survey recent results of 
Gilch, Miiller and Parkinson [21] concerning isotropic random walks on 
Fuchsian buildings. In this context a law of large numbers and a central 
limit theorem are available, with interesting formulae for the speed and 
variance in terms of an underlying automatic structure related to the 
building. 

We conclude our survey by listing some future directions in the 
theory, and providing some appendices. In the first appendix we carry 
through a ‘by-hand’ computation outlining the general theory of isotropic 
random walks on the vertices of affine buildings in the special case of 
Co buildings. In this basic case we can minimise some of the heavy 
(although beautiful) machinery used for the general case, thus making 
the analysis more accessible. In the second appendix we outline the 
representation theory of rank 2 spherical Hecke algebras, and show how 
a precise knowledge of the representation theory allows for accurate 
mixing time estimates for random walks on generalised polygons (that 
is, rank 2 spherical buildings). As a byproduct we recover a proof of 
the celebrated Feit-Higman theorem (this approach is due to Kilmoyer 
and Solomon [31], and is, in turn, an adaptation of Feit and Higman’s 
original proof from 1964 [19]). 

On a personal note, it is an absolute pleasure to dedicate this chapter 
to my friend and collaborator Wolfgang Woess on the occasion of his 
60th birthday. Wolfgang’s tireless support of young mathematicians has 
been a true gift to the mathematical community, a gift from which I have 
greatly benefited. 


1. Coxeter Systems 


Coxeter systems form the backbone of the higher objects of buildings 
and groups of Lie type. In this section we recall some basic theory of 
Coxeter systems, focussing on examples and important classes. Standard 
references include [1, 4, 26]. 


1.1. Definitions 


Definition 1.1. A Cozeter system (W,S) is a group W generated by 
a finite set S with relations 


s?=1 and (st)™*=1 foralls,te S withs At, 


where mst = mts € Zsa U {oo} for all s A t (if mst = 00 then it is 
understood that there is no relation between s and t). We sometimes 
say that W is a Coxeter group when the generating set S is implied. 
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Let (W, S') be a Coxeter system. The rank of (W, S$) is |S|. The length 
of we W is 


£(w) = min{n > 0| w= s1--- Sn with 51,...,5n € S}, 


and an expression w = $1--- 8, with n minimal (that is, n = €(w)) is 
called a reduced expression for w. It is useful to note that if w € W and 
s € 5 then €(ws) = €(w) +1. In particular, €(ws) = €(w) is not possible. 

For each I C S the standard I-parabolic subgroup of W is the subgroup 
W, = ({s | s € I}). Then (W7, J) is a Coxeter system, and hence Coxeter 
systems ‘contain’ other Coxeter systems of lower rank. This fact is very 
important in the theory, facilitating inductive arguments on the rank of 
the group. This makes the rank 2 systems particularly important as the 
base case. The rank 2 Coxeter group W = (s,t | s? = t? = (st)™ = 1) 
is just the dihedral group of order 2m (or the infinite dihedral group if 
m = oo), and is denoted by Ip(m). 

The data required to define a Coxeter system is conveniently encoded 
in a graph '(W,S) with labelled edges called the Coxeter graph. This 
graph has vertex set S, and vertices s,t € S are joined by an edge if and 
only if ms¢ > 3. If ms => 4 then the corresponding edge is given the label 
mst (thus edges with no label have m,; = 3, and if s and ¢ are not joined 
by an edge then ms = 2). A Coxeter system (W, S) is called irreducible 
if the Coxeter graph '(W, S) is connected. Note that if [(W, S$) is not 
connected, and if S = $1; U---U S; is the decomposition into connected 
components, then W is the direct product of parabolic subgroups W = 
Ws, x --- x Wg,. Thus, irreducibility is a natural assumption to make 
in the theory of Coxeter systems. 


1.2. The Coxeter Complex and Examples 


The Coxeter complex of a Coxeter system is a natural simplicial complex 
on which the Coxeter group acts, and plays an important role in the 
general theory. 

Recall that a simplicial complex with vertex set V is a collection ¥ 
of finite subsets of V (called simplices) such that for every v € V, the 
singleton {v} is a simplex (called a vertex), and every subset of a simplex 
o isa simplex (a face of o). If o is a simplex which is not a proper subset 
of any other simplex then o is a chamber of DB. 

Let = can be simplicial complex, and let < be the face relation (that 
is, o’ < o if and only if o’ is a face of o). Then (%, <) is a partially 
ordered set satisfying: 


(P1) For each pair 0,0’ € & there exists a greatest lower bound o No’. 
(P2) For each o € & the poset {o’ | o’ < o} is isomorphic to the poset 
of subsets of {1,2,...,r} for some r. 


On the other hand, any partially ordered set (X,<) satisfying (P1) 
and (P2) can be identified with a simplicial complex © by taking the 
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vertex to be the set V of all elements v € & such that r = 1 in (P2), 
and identifying each element o € © with the simplex {uv € V |v <o}. 


Definition 1.2. Let (W,S) be a Coxeter system. The Coxeter complex 
x(W,S) is the simplicial complex constructed as above from the poset 
of all cosets of the form wWy7 with w € W and I C S, ordered by reverse 
inclusion (we emphasise the reverse inclusion here: wW; < vW, if and 
only if wW; > vW,). 


Explicitly, the vertex set of the simplicial complex &(W, S) is 
V= {wWs\¢s} |we W,se€S}, 


and co = {Ws\{s} | s € S} is a chamber. The set of all chambers is 
{wep | w € W}. We have that wcp = vco if and only if w = v, and so the 
set of all chambers can be identified with W by wcg @ w. Each chamber 
has exactly |.S| vertices (namely, the chamber w has vertices wWs\{s} 
for s € S). The Coxeter complex comes equipped with a natural type 
function t : X(W,S) > 2° given by t(wWy) = S\I. Thus, the vertex 
wWs\ts has type s (more accurately, type {s}), and each chamber has 
exactly one vertex of each type. 


Example 1.3. Let (W,5S) be the dihedral group of order 6 with S = 
{s,t}. Write W, = Wis} = {1,8}, and similarly for W;. The Coxeter 
complex &(W,S') has six vertices, marked in the diagram below by e 
(vertices of type s) and o (vertices of type t). Each chamber has |S$| = 2 
vertices, and thus chambers are represented as edges in the diagram. 
Similarly, the Coxeter complex of a dihedral group of order 2m is a 2m- 
gon, and the Coxeter complex of the infinite dihedral group is a two 
sided infinite path with alternating vertex types. 


Wi 1 Ws 
tW; sWi 


tsW; stW, 


A Coxeter system is called: 


) spherical if |W| < oe, 

(2) affine if W is infinite and contains a normal abelian subgroup Q 
such that W/Q is finite, 

(3) Fuchsian if W is generated by the reflections in the sides of a 

polygon in H?. 
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(a) The Az Coxeter complex (b) The Bz (or C3) Coxeter complex 


Figure 14.1 Examples of rank 3 spherical Coxeter systems 


If (W,S) is an irreducible spherical Coxeter system then W can be 
realised as a group generated by linear reflections in E = R!*!. The 
action of W decomposes F into |W| geometric cones based at the origin, 
and by intersecting these cones with the unit sphere we can visualise 
the Coxeter complex &(W,S) as a tessellation of the (|S| — 1)-sphere 
(some examples are illustrated in Figure 14.1). There is a well-known 
classification of the irreducible spherical Coxeter systems due to Coxeter 
[15]. The nomenclature of this classification has its origins in the Cartan- 
Killing classification of simple Lie algebras over C. The list of spherical 
Coxeter systems is as follows (in each case the subscript denotes the rank 
of the system, see Figure 14.10 in Appendix C for the Coxeter graphs). 


(1) Crystallographic systems: An (n > 1), Bn = Cn (n = 2), Dn 
(n > 4), Ee, Ev, Eg, Fs, Go. 

(2) Non-crystallographic systems: H3, H4, Ilb(m) (with m = 5 or 
m > 7). 


If (W,S) is an irreducible affine Coxeter system then W can be 
realised as a group generated by affine reflections in E = RI*!-1 
(see Section 1.3). The action of W decomposes FE into geometric 
simplices, and so the Coxeter complex &(W,S) may be visualied as 
a tessellation of R'S!-1 (some examples are illustrated in Figure 14.2, 
the additional information in the figure will be explained in Section 1.3). 
The classification of irreducible affine Coxeter systems is closely related 
to the classification of irreducible spherical Coxeter systems. Specifically, 
to each irreducible crystallographic spherical Coxeter system (of type Xn, 
say) there is an associated affine Coxeter system of type Xp, obtained 
by adding one additional generator to the spherical system. In the case 
of spherical systems of type By, = Cy, there are two associated affine 
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Figure 14.2 Rank 3 affine Coxeter systems, and associated (dual) root systems 


systems, called Bn and Cx, and these are non-isomorphic if n > 2. See 
Figure 14.11 in Appendix C for the Coxeter graphs of the irreducible 
affine Coxeter systems. 

If (W,S) is a Fuchsian Coxeter system then W may be realised as a 
group generated by reflections in the sides of a polygon in the hyperbolic 
disc H?, and thus W is a cocompact discrete subgroup of PGL2(R) 
(hence the term ‘Fuchsian’). Specifically, let n > 3 be an integer, and let 
ky,...,kn = 2 be integers satisfying 


el 
yi a <n-2. (14.1) 
— ki 
i=1 
Assign the angles 2/k; to the vertices of a combinatorial n-gon F’. There 
is a convex realisation of F (which we also call F’) in the hyperbolic 
disc H?. Let W be the subgroup of PGL2(R) generated by the set S of 
reflections in the sides of F. Then (W, S$) is a Coxeter system (see [16, 
Example 6.5.3]), and if s1,...,5, are the reflections in the sides of F 
(arranged cyclically), then the order of s;s; is 

a (14.2) 

co if ji—gl> 1, 

where the indices are read cyclically. We denote this Coxeter system by 
F(k,...,kn). The group W acts on H? with fundamental domain F, 
and thus induces a tessellation of H? by isometric polygons wF', w € W. 
Examples are shown in Figure 14.3. We note that this is not a depiction 
of the Coxeter complex of these groups unless |S| = 3. For example, 
each chamber of the Coxeter complex of the group represented in 
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(a) Fuchsian Coxeter system F'(3, 3, 4) 


Figure 14.3 Fuchsian Coxeter systems 


Figure 14.3(b) is a |S'|— 1 = 5 dimensional simplex. Instead the pictures 
are (essentially) the Davis complex of the group (see [1, Example 12.43]). 
We will not go into further details, however we simply remark that this 
is a much more convenient way to visualise Fuchsian Coxeter systems 
(and their buildings). 


1.3. The Coxeter Complex of an Affine Coxeter System 


For our later discussions it is necessary to have a concrete description of 
the Coxeter complex of an affine Coxeter system in terms of root systems. 
The standard reference for this theory is [4]. Let E be a d-dimensional 
real vector space with inner product (-,-). The hyperplane orthogonal to 
the vector a € £\{0} is Hy = {x € E | (x,a) = O}, and the reflection in 
Hy is given by sy(x) = x — (x,a)aY where aY = 2a/(a,a). 

A root system in E is a finite set R of non-zero vectors (called roots) 
such that: (1) R spans EF, (2) ifa € R and ka € R then k = +1, (8) if 
a,B € R then sy(B) € R, and (4) if a, B € R then (a, BY) € Z. The rank 
of Ris d=dim(F). Figure 14.4 illustrates three rank 2 root systems. 

Let R be arank d root system. There exists a subset {a1,...,a@q} CR 
of simple roots with the property that every a € R can be written as a 
linear combination of a@1,...,@q with integer coefficients which are either 
all nonpositive, or all nonnegative. Those roots whose coefficients are all 
nonnegative are called positive roots (with respect to the fixed chosen 
set of simple roots), and the set of all positive roots is denoted Rt. Then 
R=RtUCR?*). 

The Weyl group of R is the finite subgroup Wo of GL(E) generated 
by the reflections sy with a € R. For each i = 1,...,d write 3; = 5g,, 
and let S9 = {s1,..., Sq}. Then (Wo, So) is a spherical Coxeter system, 
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(a) The Ay root system (b) The Cy root system (c) The Gg root system 


Figure 14.4 The irreducible rank 2 root systems 


with the order of s;s; being my, where 7 — 2/mj is the angle between 
a; and a;. The Coxeter system (Wo, So) is irreducible if and only if the 
root system R is irreducible (where the latter means that there is no 
partition R = R; U Rz with R; and Rz nonempty such that (a, 6) = 0 
for alla € Ry and all B € Ro). 

The irreducible root systems admit a complete classification, and 
explicit descriptions of each system can be found in [4, Plates I-IV]. 
They fall into four infinite families Ag (d > 1), Bg (d => 2), Cg (d = 2) 
and Dg (d > 4), and 5 exceptional types E¢, 7, Eg, F4 and Go. If R is 
an irreducible root system of type Xq then the Coxeter system (Wo, So) 
is also of type Xq, and hence every irreducible crystallographic spherical 
Coxeter group can be realised as the Weyl group of an irreducible root 
system. 

For each a € R and each k € Z let Ay , = {x € E | (z,a) = k}. Thus, 
the affine hyperplane Hy x is a translate of the linear hyperplane Hy = 
Hy,9. The orthogonal (affine) reflection in the hyperplane Hy, is given 
by the formula sy 4,(z) = 2—((t,a)—k)a” for « € E, and the affine Weyl 
group of R is the subgroup Wag of Aff(Z) generated by all reflections 
Sy,k witha € Rand k ¢€ Z. The root system R has a unique highest root 
gy (the height of the root a = aja, +---+ agaq is a1 +---+ aq, and g is 
the unique root of greatest height). Let s9 = sy1. Then Wag is generated 
by Sap = {S0, 81,.--, Sq} and the order of sos; is mo;, where 2 — 1/moj; 
is the angle between —g and a;. Thus, (Wag, Saft) is a Coxeter system, 
and (Wo, 59) is a parabolic subsystem of (Wag, Sag). For 4 € E let 
4 € Aff(E) be the translation 4 (x) = x+A. Since sy 4 = thay Sa we have 


Wat = Q» Wo, where Q=Zay+---+ Za; is the coroot lattice. 


Thus, (Wag, Saft) is an affine Coxeter system. All irreducible affine 
Coxeter systems arise in this way. 

Let @1,...,@q € E be the dual basis to a1,...,a@q, given by (@;, aj) = 
6;,;. The coweight lattice P of R is 


P={eEE|(A,a) € Z for alla € R} = Zo, 4+ ---+ Zag. 
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Note that Q C P, and that Q and P are both Wag-invariant lattices. 
The set of dominant coweights is 


Pt =No) +---+Nogq. 


The family of hyperplanes Hy ,, a € R, k € Z, tessellates E into 
d-dimensional geometric simplices (the chambers). The extreme points 
of the chambers are vertices, and each chamber has exactly d+1 vertices. 
The resulting simplicial complex is isomorphic to the Coxeter complex 
xu( Wat, Safe). The affine Weyl group Wag acts simply transitively on the 
set of all chambers. The fundamental chamber is 


co= {xe E'| (x, a;) > 0 for 1 <i < d, and (z,¢) < 1} 


and we often identify W. with the set of chambers by w = weg. The 
set P of coweights is a subset of the set of all vertices of U( Wag, Sag), 
called the special vertices. The set Q is the set of type 0 vertices, and 
every chamber has exactly one type 0 vertex. 

The action of Wo decomposes F into |Wo| geometric cones based at 
the origin, and the translates of these cones by elements of P are called 
sectors. The fundamental sector is 


so ={x € E | (x,a;) => 0 for all 1 <i < d}. 


We also write Et = so (roughly speaking, we write s9 when we are 
interested in the simplicial structure, and E+ when we are interested in 
the metric structure). 

Examples of the above construction of affine Coxeter systems are 
illustrated in Figure 14.2, where the sector s9 and the chamber cg are 
shaded (light and dark, respectively). The coroot lattice Q is indicted 
with heavy dots. In Figure 14.2(a) all vertices are special, while in Figure 
14.2(b) only the vertices of valency 8 are special. 


2. Buildings 


Buildings were introduced by Jacques Tits in the 1950s. “The origin of the 
notions of buildings and BN-pairs lies in an attempt to give a systematic 
procedure for geometric interpretation of the semi-simple Lie groups and, 
in particular, the exceptional groups’ [49, Introduction]. Over the past 60 
years the theory has grown immensely, and has had diverse applications 
in geometry, group theory, representation theory, and geometric group 
theory. In this section we give a brief introduction to the theory, with 
our main references being [1, 42, 49]. 


2.1. Definitions and Basic Properties 


Buildings are defined axiomatically, and historically there have been 
two main approaches to the theory, both due to Jacques Tits. The 
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initial approach was via simplicial complexes, and later an approach was 
developed using ‘chamber systems’ (see [51] for an enlightening historical 
discussion). Both approaches are relevant and useful, however here we 
have chosen to adopt Tits’ original simplicial complex definition (see 
Remark 2.3 for the other approach). 


Definition 2.1. Let (W,S) be a Coxeter system with Coxeter complex 
x(W, S). A building of type (W, S') is a nonempty simplicial complex ¥ 
with a family A of subcomplexes (called apartments) such that 


(B1) each apartment A € A is isomorphic to the Coxeter complex 
x(W,S), 

(B2) given any two simplices of © there is an apartment A € A 
containing both of them, and 

(B3) if A, A’ € A are apartments containing a common chamber then 
there is a unique simplicial complex isomorphism yw : A’ > A 
fixing each simplex of the intersection AN A’. 


Let & be a building of type (W,S). The rank of & is |S|. Fix, once 
and for all, an apartment of © and identify it with &(W,S). Thus, 
we regard X(W,S) as an apartment of ©, the ‘standard’ (or ‘base’) 
apartment. The type function on the Coxeter complex ©(W, S$) extends 
uniquely to a type function t : & > 2° on the building making © into a 
labelled simplicial complex. The isomorphism in (B3) is then necessarily 
type preserving. Let A be the set of all chambers of &. 


Example 2.2. Buildings of type Ay = Ip(co) are equivalent to trees 
in which every vertex has valency at least 2. The apartments are two 
sided infinite geodesics in the tree, and the chambers are the edges of 
the tree. There are two types of vertices, indicated by e and o in the 
picture. Buildings of higher rank are considerably more sophisticated 
objects, although the tree example is very instructive. 


The dimension of o € & is |o| —1, and the codimension of 0 € X 
is |S | — |o|. Each chamber of © has dimension |S| — 1 (that is, has |$| 
vertices). A panel of © is a codimension 1 simplex. Chambers x,y € A 
are s-adjacent (written « ~s y) if and only if they share a panel of 
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type S\{s} (that is, if either c = y or x y is a panel of type S\{s}, or 
equivalently, if either z = y or x\y is a vertex of type s). 

For example, in a rank 3 building the chambers are triangles with the 
three edges (panels) of the triangle corresponding to the three types of 
adjacency. Chambers are ‘glued together’ along their s-edges if and only 
if they are s-adjacent. In a rank d building the chambers are (d — 1)- 
simplices, and are glued together along their panels (codimension 1 
faces). An alternate way to visualise a higher rank building is to imagine 
each chamber as a d-gon, with the sides in bijection with S, and these 
polygons are glued together along their edges according adjacency. This 
is particularly useful when (W,S) is Fuchsian. 


AZ 


A gallery of type (s1,...,5n) € S” joining x € A to y € A is a sequence 
20, X1,-.-,2%n € A of chambers such that 


L=M~s, M1 ~sQ °° ~s, In =Y With 2-1 Aa; foralll <j <n. 


This gallery has length n. 

The W-distance function 6: A x A — W on A is defined as follows: 
If z,y € A and if there is a minimal length gallery of type (s1,..., $n) 
from x to y, then let 


6(2, y) = $1$2°++ Sn. 


This does not depend on the particular minimal length gallery chosen. 
A building is called thick if |{y ¢ A | xa ~s5 y}| = 2 for all chambers 
zé€Aand alls eS, and thin if |{ye A| xz ~s y}| = 1 for all chambers 
xz € A and all s € S. It is clear that the Coxeter complex &(W,S) is 
a thin building of type (W,S), and that all thin buildings are Coxeter 
complexes. Typically we are interested in thick buildings. 
A building is regular if for each s € S the cardinality 


ds =|l{ye A|x~s y}| is finite and does not depend on x € A. 


All locally finite thick buildings whose Coxeter group (W, S$’) has ms < 
oo for all s,t € S are necessarily regular (see [36, Theorem 2.4]). Here 
locally finite means that |{y € A | «© ~s y}| < o for all « € A and 
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s € S. The numbers (qs)seg are called the thickness parameters (or just 
the parameters) of the (regular) building. 
For each x € A and each w € W let 


Aw(2) = {y € A | 6(z,y) = w} be the sphere of radius w centred at x. 


If (A, 6) is regular, then by [36, Proposition 2.1] the cardinality qu = 
|A~w(z)| does not depend on z € A, and is given by 


dw = 4s, °** Ys, Whenever w = 51 --- Ss; is a reduced expression. 


The adjectives ‘spherical’, ‘affine’ and ‘Fuchsian’ from Coxeter systems 
carry over to buildings. Thus, the apartments of spherical, affine, or 
Fuchsian buildings are tessellations of a sphere, Euclidean space, or 
hyperbolic disc, respectively. 


Remark 2.3. Let & be a building of type (W,S) with chamber set A 
and Weyl] distance function 6: A x A > W. It is not hard to see that 
the pair (A, 4) satisfies the following: 


(B1) (a, y) = 1 if and only if « = y. 
(B2)' If 6(z,y) = w and z € A satisfies d(y,z) = s with s € S, 


then 6(z,z) € {w, ws}. If, in addition, €(ws) = €(w) + 1, then 
b(2, 2) = ws. 

(B3)’ If 6(z, y) = w and s € S, then there is a chamber z € A with 
d(y, Zz) = s and d6(a,z) = ws. 


Conversely, suppose that we are given a set A and a function 6: Ax A > 
W satisfying (B1)’, (B2)’, and (B3)’. For each J C S and each z € A let 
Rr(z) = {y € A | (a, y) € Wy}. The poset (2, <) of all sets of the form 
Ry(«) with I C S and « € A (ordered by reverse inclusion) satisfies 
conditions (P1) and (P2) from Section 1.2, and hence we may regard 
x as a simplicial complex. It turns out that this simplicial complex is 
a building of type (W, S$) (the most challenging thing to check is the 
existence of apartments). This gives a second approach to buildings: 
Specifically one can take a building of type (W,S) to be a pair (A, 64) 
where A is a set and é6: Ax A > W is a function satisfying (B1)’, 
(B2)’ and (B3)’. See [1, 51] for further details. A certain fluency in both 
approaches is useful when working with buildings. 


2.2. Buildings and Groups 


The group theoretic counterpart to a building is the notion of a Tits 
system in a group. This concept has been very influential in group theory 
due to the existence of Tits systems in many ‘Lie theoretic’ groups, 
facilitating a uniform treatment of these groups. We will see that every 
Tits system gives rise to a building, however not every building results 
from a Tits system. 
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Definition 2.4. A Tits system in a group G is a quadruple 
(B,N,W,S) where B and N are subgroups of G, and (W,S) is a 
Coxeter system, and the following axioms are satisfied: 


(T1) The group G is generated by BU N. 

(T3) The group H = BNN is anormal subgroup of N, and N/H = W. 

(T3) If ns € N maps to s € S$ under the natural homomorphism of NV 
onto W then for all n € N we have BnBn,B C BnBU Bnn,B. 

(T4) With ns as above, nsBn>! # B for all s € S. 


Since H is a subgroup of B there is no harm in writing wB in place of 
nB whenever n € N maps to w € W under the homomorphism of N 
onto W, and we will do so throughout. 


The axioms of a Tits system may appear as foreign to the reader as 
the axioms of a building! Thus, we pause to mention some important 
classes of groups that admit Tits systems. To begin with, the Chevalley 
groups and twisted Chevalley groups admit natural Tits systems, with 
the associated Coxeter systems being of spherical type. For excellent 
treatments of this theory, see [11, 47]. For readers familiar with Chevalley 
groups, the Tits system is as follows. Let R be an irreducible root system, 
and let G(F) be the Chevalley group of type R over the field F. Recall 
that G(F) is generated by elements z(t) with a € R and t € F. Let 


Ne (t) = tq (t)t-o(—t")aq(t) and — hqv (t) = ne (t)na(—1) 


fora € R and t € F*. Let N (respectively H) be the subgroup of G(F) 
generated by the elements ng(t) (respectively hav (t)) with a € R and 
t € F*. Let U be the subgroup of G(F) generated by the elements 2 (t) 
with a ¢ Rt and ¢ € F, and let B = (U,H). Then (B,N, Wo, So) is a 
Tits system in G(F). 

When the field F has a discrete valuation Iwahori and Matsumoto [27] 
discovered that the Chevalley group G(F) admits another Tits system, 
this time with Coxeter group being the affine Weyl group Wag. Examples 
of fields with discrete valuation include the p-adic numbers Q,, and the 
field of Laurent series K(¢)) with K any field. For concreteness, suppose 
that F = K(¢). Let K = G(K|[t]]) be the Chevalley group defined 
over the ring of power series with coefficients in K (the valuation ring 
of F). The evaluation map @ : K|[t]] > K, t t 0, induces a group 
homomorphism 0 : K > G(K). Let N = N(F) and B = B(K) be the 
groups from the previous paragraph (for the groups G(F) and G(K) 
respectively). The Iwahori subgroup of G(F) is the inverse image of B 
under @. That is, J = 6~!(B). Then (1, N, Wage, Sapp) is a Tits system 
in G(F). See [10, 27] for details. 

There is a vast generalisation of the notation of a Chevalley group. 
Recall that Chevalley groups are constructed as automorphism groups 
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of finite dimensional Lie algebras associated to Cartan matrices. In a 
similar (although highly non-trivial) way there is a construction of groups 
using infinite dimensional Lie algebras associated to generalised Cartan 
matrices (so called Kac—Moody algebras, see [28]). The associated Kac-— 
Moody groups admit Tits systems with more general Coxeter systems 
(see [53]). In fact every Coxeter system with ms: € {2,3,4,6, co} arises 
as the Coxeter group of a Tits system in a Kac—Moody group. 

We will now describe the connection between Tits systems and 
buildings. Let & be a building with system of apartments A. Suppose 
that G is a group acting on & by type preserving simplicial complex 
automorphisms, and that G preserves the apartment system A. We 
say that G acts strongly transitively relative to A if it is transitive on 
pairs (A,x) with A an apartment in A and x a chamber of A. For the 
statement of the following theorem it is convenient to adopt the approach 
to buildings from Remark 2.3. 


Theorem 2.5. (1) Let (B,N,W,S) be a Tits system in a group G. 
Let A= G/B, and defined: Ax A> W by 


6(gB,hB)=w_ if and only if g th € BuB. 


Then (A, 4) is a thick building of type (W,S'). The set A= {wB | we W} 
is an apartment, A = {gA | g € G} is a system of apartments, and G 
acts strongly transitively with respect to A. 

(2) Let (A,8) be a thick building of type (W,S) and suppose that 
a group G acts strongly transitively with respect to a G-invariant 
apartment system A. Let o € A be a chamber, and let A € A be an 
apartment containing o. Let 


B={gE€G|go=o0} and N={geG|greA for allzeé A}. 
Then (B,N,W,S) is a Tits system in G. 


Thus, we have a wealth of examples of thick buildings. In particular, 
given that every Coxeter system (W,S) with mz € {2,3,4,6,co} can 
occur as the Coxeter system of a Kac-Moody group, there are thick 
buildings of type (W,S) for every such (W,S). However, we should 
emphasise that not all buildings arise from this type of construction 
(see, for example, Theorem 2.14 below). 


2.3. Affine Buildings 


In this section we discuss some additional structural theory for affine 
buildings. Let © be an affine building of type (Wag, Sapp). The 
special vertices of the Coxeter complex of (Wag, Saf) are the elements 
of the coweight lattice P, and the special vertices of X are the vertices 
that are special vertices in some apartment. Let V be the set of all 
special vertices of ©. A sector in X is a subset s which is isomorphic to 
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Figure 14.5 A small piece of an Ap building 


a sector in some apartment of the building (where sectors in apartments 
are as in Section 1.3). Thus, sectors are always based at special vertices. 
A fundamental fact concerning sectors in affine buildings is (see [1, 
Chapter 11)): 


(S1) If x is a chamber of &, and if s is a sector of X, then there is a 
subsector 5’ of s such that 5’ U x is contained in an apartment. 


Figure 14.5 shows a simplified picture of an affine building of type Ag; 
however, note that if the building is thick then the ‘branching’ actually 
occurs along every wall, and so the picture is rather incomplete. All 
vertices in this building are special, and a sector is shaded. 

The following notion of ‘vector distance’ gives a refined way of 
measuring the distance between special vertices in affine buildings. 


Definition 2.6. Let x,y € V be special vertices of ©. The vector 
distance d(x, y) € P* from x to y is defined as follows. By (B2) there 
is an apartment A containing x and y, and let y : A > & bea type 
preserving isomorphism. Then we define 


d(z,y) =(vVy)-¥@)", 


where for w € P, we denote by w* the unique element in Wo P*. 
This value is independent of choice of apartment A and the isomorphism 
wv :A— X& (see [36, Proposition 5.6]). 


More intuitively, to compute d(x, y) one looks at the vector from x 
to y (in any apartment containing x and y) and takes the dominant 
representative of this vector under the Wo-action. 

If A is an apartment and s C A is a sector of A, then the retraction 
of & onto A with centre s is the map pa,; : & — A computed as 
follows: If « € X, choose (by (S1)) an apartment A’ containing x and a 
subsector of s. Let w : A’ > A be the isomorphism from (B3), and let 
PA,s(x) = W(x). It is easy to check that this value is independent of the 
apartment A’ chosen. Intuitively speaking, the retraction p,_, flattens 
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Figure 14.6 Vector Buesmann function for a tree 


the building onto the apartment A with the centre of this flattening being 
‘deep’ in the sector s (this is illustrated in Figure 14.6 for the rank 2 case 
of trees). 

If s is a sector in an apartment A, then there is a unique isomorphism 
WA : A —> X(Wart, Sar) mapping s to so and preserving vector 
distances (cf. [37, Lemma 3.2]). This gives a canonical way of fixing 
a Euclidean coordinate system on the apartment A with respect to the 
sector 5. 


Definition 2.7. The vector Busemann function associated to the 
sector $ is the function 


hs: X— P given by hs() = WaA,s(PA,5(2)), 


where A is any apartment containing s (this does not depend on A). We 
write h= hey. 


Example 2.8. Let © be a homogeneous tree of degree q+ 1. Thus, © 
is an affine building of type Ay (see Example 2.2). In this case the vector 
distance d(z,y) is simply the graph distance d(z,y), and the vector 
Busemann function hs : & — P is the familiar ‘horocycle function’ 
(perhaps with an additional superficial minus sign; see [3, Figure 14.1]). 
This is illustrated in Figure 14.6 — to compute h,(v) for a vertex v one 
reads off the ‘level’ of v. For example, hg(x) = —1. Note that there are 
infinitely many vertices on each horizontal level. More generally, the vec- 
tor Busemann functions for an affine building are vector analogues of the 
usual Busemann functions for a CAT (0) space (see [41, Proposition 2.8]). 


Example 2.9. Let R be an irreducible root system, let G = G(K((t))) 
be a Chevalley group over the field K(), and let K = G(K{[¢]]). 
Following the work of Iwahori and Matsumoto [27] and Bruhat and 
Tits [10], G/K is the set of type zero vertices of an affine building 
(thus is a subset of the set of special vertices). The Cartan and Iwasawa 
decompositions of G are (respectively): 
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G= ||] KhK and G=|| UK, (14.3) 
AE QNPt weQ 


(with U being the subgroup of G generated by the elements zq(f) with 
a € R* and f € K(), and with & given by & = GF) Pe (t~%) 
if A = ajo +---+ ago’). The vector distance between vertices gk and 
hK in & is 


d(gK,hK)=  ifandonlyif gg 'hK C KiK, (14.4) 


and since each u € U stabilises a subsector of the fundamental sector of 
x, the Busemann function h is given by 


h(gk) = uw if and only if gk C Ut K. (14.5) 


Thus, the vector distance d and the vector Busemann function h are 
natural statistics from the group theoretic context, encoding the Cartan 
and Iwasawa decompositions (respectively). 


2.4. Generalised Polygons 


Let © be a building of type (W, S) with chamber set A and Wey] distance 
6:AxA—> W. Let ICS, and let Wy; be the parabolic subgroup of W 
generated by J. The J-residue of a chamber x € A is 


Ry(a@) = {ty € A| d(@,y) € Wy}. 


This residue is a building of type (W7, J) (it is easiest to check this using 
the formulation of buildings from Remark 2.3). Thus, general buildings 
are ‘made up of’ many other buildings of lower rank. The rank 1 residues 
have no interesting structure, and so the important case is rank 2. 
Hence, buildings of type [2(m) play a critical role in the theory. We 
have already discussed buildings of type Jo(co) in Example 2.2, and so 
here we consider the building of type Jo(m) with 2 < m < ow. It is easy 
to verify that these buildings are equivalent to bipartite graphs with: 


diameter m, and girth 2m (where girth is the length of the shortest 
cycle). 


Such graphs are also known in the literature by another name: generalised 
m-gons. Thus, buildings of type Io(m) are equivalent to generalised 
m-gons. The chambers of the building are the edges of the generalised 
m-gon, and the apartments of the building are the cycles of length 2m 
in the generalised m-gon. Some examples are shown in Figure 14.7. 

It is not hard to see that finite thick generalised m-gons are necesarily 
biregular (with alternating valencies g +1 and r+ 1, say), and that if 
m is odd then necessarily gq = r. There is a lot to say about generalised 
m-gons (see, for example, [56]). Let us simply recall some of the key 
results. Since we are rarely interested in ‘ordinary’ m-gons, we will 
usually omit the adjective ‘generalised’. 
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(a) Building of type I,(3), p=q=2 (b) Building of type I,(4), p=q=2 


Figure 14.7 Examples of finite thick generalised m-gons 


Theorem 2.10. [50] For each m > 2 there exist thick m-gons. 


Theorem 2.10 shows that there are many examples of thick m-gons. 
The proof of the theorem is via a ‘free construction’ which produces 
m-gons in which every vertex has infinitely many neighbours. Finite thick 
m-gons are much more restricted, as shown by the following beautiful 
and unexpected theorem due to Feit and Higman. 


Theorem 2.11. [19] Finite thick generalsed m-gons exist if and only 
if m € {2, 3, 4, 6, 8}. 


The ‘only if’ part of the Feit-Higman theorem follows from rather 
involved character theory (see Appendix B). For the ‘if’ part of the 
theorem, the existence of finite thick 2-gons is clear (they are just 
complete bipartite graphs), and examples of finite thick 3-gons, 4-gons, 
6-gons and 8-gons come from the Chevalley groups A2(q), B2(q), Ge(q), 
and the twisted Chevalley group ?F4(2?”+!) via Theorem 2.5. 

Thus, in the finite theory the m-gons with m = 2,3,4,6,8 play a spe- 
cial role. We often call 4-gons, 6-gons and 8-gons quadrangles, hexagons 
and octagons respectively. Generalised 3-gons are called projective planes 
(this comes from the older language of incidence geometry). There are 
severe restrictions on the possible thickness parameters of projective 
planes, quadrangles, hexagons and octagons. Most of these restrictions 
have character theoretic proofs (we prove some of these in Appendix B). 
References to the original works can be found in Kantor [30]. 


Theorem 2.12. Let T be a finite thick m-gon with parameters (q, 1). 


(1) If m = 3 then q =r, and if q=1,2 mod 4 then q¢ is a sum of 
two squares. 
(2) Ifm=A4 then q<1?, r< q’, and P(qr+1)/(qt+7r) €Z. 
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(3) Ifm=6thengsr,r<@Q, P(Pretqrth)/(C+artr?) €Z, 
and ./qr € Z. 

(4) Ifm=8 theng<r?,r< @, A(qrt+l(2r?+)/(G4+r(P+r?) € 
Z, and ./2qr € Z. 


The number theoretic part of statement (1) is called the Bruck-Ryser- 
Chowla theorem. For example, it prohibits the existence of a projective 
plane with parameter q = 6. 

Together, Theorems 2.11 and 2.12 place a lot of conditions on the 
structure of general locally finite thick buildings. In particular, we 
immediately have the following corollary by looking at rank 2 residues. 


Corollary 2.13. Let X& be a locally finite thick building of type (W, S) 
with parameters (qs)scg. For each s,t € S with s # t let mst be the 
order of st. Then: 


(1) mse € {2,3,4,6,8, 00} for all s,t ES with s At, and 
(2) if mst € {3,4,6,8} then the pair (qs, q) satisfies the constraints 
from Theorem 2.12. 


It is an open problem to determine the possible parameters of thick 
projective planes, quadrangles, hexagons and octagons. The known 
examples have the following parameters (where we arrange the parame- 
ters (q,r) so that g < r). Projective planes: (q, g) with ¢ a prime power. 
Quadrangles: (q, q), (q, 97), (q?, ¢°), (q—1, ¢+1) with q a prime power. 
Hexagons: (q,q), (q, q°) with q a prime power. Octagons: (q, q2) with 
q = 2?*+1 an odd power of 2. Constructing a thick generalised m-gon 
with parameters other than these, or proving further restrictions on the 
possible parameters, would be revolutionary. Finally, we note that for 
a given value of the parameters there may be multiple non-isomorphic 
generalised m-gons. For example, there are four distinct projective planes 
with parameters (9, 9). 


2.5. Classification and Free Constructions 


Thick irreducible spherical (respectively affine) buildings of rank at least 
3 (respectively 4) have been classified by Tits (respectively, Tits and 
Weiss) (see [49] and [10, 52, 59]). Put very roughly, this classification 
says that all irreducible thick spherical buildings of rank at least 3 arise 
from groups of Lie origin via Tits systems, and that all irreducible thick 
affine buildings of rank at least 4 arise from groups of Lie origin defined 
over fields (or skew-fields) with discrete valuation via affine Tits systems. 
The precise statement of these classification theorems is involved (see the 
above references for details). 

On the other hand, the following essentially free construction shows 
that the situation is very different for Coxeter systems which contain no 
irreducible rank 3 spherical parabolic subgroups. 
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Theorem 2.14. [43] Let (W,S) be a Coxeter system such that every 
irreducible rank 3 parabolic subgroup of W is infinite. Suppose that 
(ds)seg tS a sequence of integers such that for each pair s,t € S there 
exists a generalised mst-gon with parameters (qs, qt). Then there exists a 
locally finite thick regular building of type (W, S:) whose rank 2 residues of 
type Wis,t} range through any desired set of generalised mst-gons having 
parameters (qs, 4t)- 


The above theorem tells us that rank 3 irreducible affine buildings 
(that is, those of type Ao, Bo, and Go) cannot be classified (at least not 
in the spirit of the higher rank classification; for example, one can make 
a Ag building with thickness parameter g = 9 whose rank 2 residues can 
be chosen freely from the four non-isomorphic projective planes with 
q = 9). The theorem also applies to all Fuchsian Coxeter systems, and 
so these buildings are also unclassifiable. In other words, many of these 
buildings are not related in any nice way to groups. Using Corollary 2.13 
and Theorem 2.14 we have the following existence result for Fuchsian 
buildings: 


Theorem 2.15. Let (W,S) be a Fuchsian Coxeter system of type 
F(ky,...,kn). There exists a locally finite thick building of type (W, S) 
if and only if ky € {2,3,4,6,8} for alli =1,...,n and either kj € {2,4} 


for some i=1,...,n or |{z | kj = 8}| ts even. 


Proof. Suppose that there is a locally finite thick building © of type 
F(k,..., kn). By Corollary 2.13 we have k; € {2,3, 4,6, 8} (since k; 4 co 
by definition for Fuchsian systems). Let (q1,..., Qn) be the parameters 
of © (arranged cyclically so that the parameters of the rank 2 residue 
corresponding to kj are qj and q+1). If kj # 2,4 for all ¢ then kj € 
{3,6,8} for all 7. If kj € {3,6} then /a@qisi € Z, and if kj = 8 then 
/2¢ G41 € Z (see Theorem 2.12). Multiplying these conditions together 
gives 2°/2q1 --- qn € Z where a = |{i | kj = 8}|, and hence a is even. This 
proves the ‘only if’ part of the theorem. We leave the ‘if’ part of the 
theorem as an exercise (using Theorem 2.14 and the known examples of 
generalised m-gons listed after Corollary 2.13). 


3. Random Walks on Buildings 


A random walk on a finite or countable space X is a sequence (Xn)n>0 
of X-valued random variables governed by a stochastic transition matrix 
(or transition operator) P = (p(x, y))x,yex. That is, 


p(2,y) = P[Xn41 = y|Xn=2| for all z,y € X and all n= 0, 
and the transition operator P acts on €!(X) by 


Pf(z)= \> p(w, f(y) for all x € X. 


yex 
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The n-step transition probabilities of the walk are 


and we have P” = QC I) eyex: The random walk (Xn)n>0 is 
irreducible if for each pair x,y € X there exists n > O such that 
p™ (x,y) > 0. For the general theory of random walks we refer to [60]. 

Here we are interested in random walks on buildings © (and associated 
groups). There are a few variations; for example, one might consider 
random walks on the set A of chambers of X&, or one might consider 
random walks on the vertices of ©. This latter case is particularly natural 
for affine buildings. To begin with we will consider random walks on the 
chambers of a general (locally finite) building, mainly following the setup 
from [21, 36]. 


3.1. Random Walks on Chambers and the Hecke Algebra 


Let = be a locally finite building of type (W, S) with chamber set A and 
parameters (qs)seg. Let (Xn)n>0 be a random walk on the chamber set. 
Without some additional assumptions on the walk there is not so much 
that one can say. A natural assumption is to assume that the walk is 
isotropic, meaning that the transition probabilities depend only on the 
Weyl] distance: 


Definition 3.1. A random walk (Xn)ns0 on A is isotropic if the 
transition probabilities of the walk satisfy p(x, y) = p(2’, y’) whenever 
d(x, y) = d(a', y’). 


Isotropic random walks have a beautiful algebraic structure. For each 
we W let Py = (pw(2,y))2,yea be the transition operator of the 
isotropic random walk with 


q, if d(2,y) =w 


Zz, = . 
P(t, y) 0 otherwise. 


The following elementary proposition shows that every isotropic random 
walk on A is a convex combination of the random walks P,,, w € W. 


Proposition 3.2. A random walk on A with transition operator P is 
isotropic if and only if 


P= > QyPy where ay > 0 and ss dy = 1, 
wew weWw 


in which case p(x, y) = Gude: if O(@, y) = w. 


Therefore we are naturally led to consider linear combinations of the 
(linearly independent) operators Py, w € W. Let # be the vector space 
over C with basis {Py | w € W}. The key facts about Y are summarised 
below (see [36, § 3]). 
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Theorem 3.3. [36] The vector space F is an associative unital algebra 
under composition of linear operators, and the multiplication table with 
respect to the vector space basis {Py | w € W} is given by 


PuPv = >> cllyPw where cl, =“ |Au(a) NA, «(y)| 
, : qu du 
weWw 
for any t,y € A with d(4,y) = w. In particular, the intersection 


cardinalities |Ay(2) A A,-1(y)| depend only on u,v and d(x, y), and 
forwé W ands éS we have 


Pre if &(ws) = €(w) +1 


PiyPe = 
Vas Pus +A— ag) Pw if (ws) = &(w) - 1. 


(14.6) 


The algebra # is called the Hecke algebra of the building. If 5(z, y) = 
w then the n-step transition probability p(x, y) is given by 


pM =7) 0s where Pha Ye a Pa: 
weW 


Thus, finding p(x, y) when 6(x,y) = w is equivalent to finding the 
coefficient of Py in P”. 

Before surveying known results on isotropic random walks, let us 
briefly indicate how isotropic random walks arise from bi-invariant 
measures on groups acting on buildings (for example, groups of Lie 
type, or Kac-Moody groups). Specifically we have the following (see 
(14, Lemma 8.1] for a proof in a similar context). 


Proposition 3.4. Let G be a locally compact group acting transitively 
on a regular building &, and let B be the stabiliser of a fixed base 
chamber o. Normalise the Haar measure on G so that B has measure 1. 
Let g be the density function of a bi-B-invariant probability measure 
on G. If the group B acts transitively on each set Aw(o) with we W, 
then the assignment 


p(go, ho) = g(g-'h) 


for g,he G defines an isotropic random walk on the chambers of X. 


3.2. Isotropic Random Walks on Spherical Buildings 


Let & be a locally finite thick spherical building of type (W,S) and let 
(Xn)n>0 be an isotropic random walk on the set A of chambers with 
transition operator P = (p(z, y))z,yea- Because the set A of chambers 
is finite, natural questions to ask include: 


(1) What is the limiting distribution of the walk? 
(2) What is the value of p\™ (x, y)? 
(3) What is the mixing time for the walk? 
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The first question is very easy to answer: 


Proposition 3.5. Let (Xn)ns0 be an irreducible isotropic random walk 
on the set A of chambers of a locally finite thick spherical building. Then 
the uniform distribution is the unique invariant measure, and 


lim p(x) = ao for allxe A. 
n> oo |A| 

Proof. Using the thickness of the building it is not difficult to see 

that irreducible isotropic random walks on A are necessarily aperiodic 

(see [21, Lemma 4.3]). Thus, an irreducible isotropic walk has a unique 

stationary measure. To see that this stationary measure is the uniform 

measure u: A — [0,1], note that for each y € A, 


Y= u2)p(@, y) = a ye PS) 


ceA weW reAw(y) 
1 q 1 1 
Al 2s, ag (ae a 
wew fw weW 
1 


where we have used the fact that if 6(z, y) = w then 6(y,z) = w ~, and 
that qy-1 = qw- 


To address questions (2) and (3) we can apply the techniques from [18], 
where the representation theory of Hecke algebras is used to analyse 
convergence of systematic scan Metropolis algorithms. The aims and 
context of [18] are quite different from our setting here, and so we will 
give an overview of the basic setup of [18], translating into our building 
theoretic point of view. 

We first recall some basic representation theory of finite dimensional 
associative unital algebras over C. This theory generalises the more 
familiar representation theory of finite groups (see, for example, [20]). 


Definition 3.6. Let @ be a finite dimensional unital associative 
algebra over C. A representation of & is a pair (0, V) where V is a 
C-vector space, and p : & — End(V) is an algebra homomorphism. 
The character of the representation (p, V) is the function x, : # > C 
given by 

Xp(A) = tr(p(A)) for all Ac &. 


The dimension of (p, V) is dim(V) (assumed to be finite throughout this 
section). Sometimes it is convenient to simply denote a representation 
(p, V) by p, and to write V = Vy. 


Since End(V) = Mg(C) (the algebra of d x d matrices with entries 
in C, where d = dim(V)), arepresentation of Y amounts to ‘representing 
the algebra elements by matrices’. Familiar notions of direct sum, 
subrepresentations, and irreducibility carry over from the group setting. 
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An algebra is semisimple if every finite dimensional representation 
decomposes as a direct sum of irreducible representations. In this case 
the irreducible representations are the ‘atomic building blocks’ of the 
representation theory in the sense that every representation can be 
written as a direct sum of these atoms. Note that the group algebra 
of a finite group is necessarily semisimple (this is Maschke’s theorem), 
however, general algebras need not be. 


3.2.1. Hecke Algebras and the Geometric Representation 

Let (W,S) be a spherical Coxeter system, and let (q5)s<g be a sequence 
of numbers with gq, > 0 and qs = q@ if s and t are conjugate in W. Let 
H be the algebra over C generated by symbols Ty (with w € W) with 
relations 


Tws if £(ws) > L(w) 


Tio = 
ane Gs Twst+d—a5)Tw if €(ws) < ew). 


(14.7) 
The algebra # is unital (with identity T.) and associative, and is called 
an abstract Hecke algebra (see [26]). 

If (ds)seg are the parameters of a locally finite spherical building of 
type (W,S) then, comparing (14.6) and (14.7) we see that the Hecke 
algebra Y of X gives a representation of the abstract Hecke algebra 
H. More specifically, let Va = @z<a Cx be a vector space with basis 
indexed by the the chambers of the building, and let pa : # — End(Va) 
be the linear map with pa(Ty) = Py for all we W. Then 


(pa, Va) is a |A|-dimensional representation of #7. 


We call this representation the geometric representation of the Hecke 
algebra #. In fact the map pa: # > # is bijective, and so Y= #. 
We emphasise that the geometric representation of the abstract Hecke 
algebra # only exists when there is a building with parameters (qs) s<¢9. 

It is well known that the algebra # is semisimple (see [24]). 
Thus, the geometric representation decomposes into a direct sum of 
irreducible representations. Writing Irrep(#) for the set of irreducible 
representations of # (more formally, isomorphism classes of irreducible 
representations) we have 


em 
Yue GD Ye” 
pelrrep(#) 


where for each p € Irrep(.#@) the integer m, > 0 is the multiplicity of 
(p, Vp) in (pa, Va). Thus, the character xq of the geometric represen- 
tation is given by 


XA = Ss ™pXp- (14.8) 
pelrrep(#) 


The first fundamental task is to compute the multiplicities mp. 
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Theorem 3.7. The multiplicity mp of the irreducible representation o 
in (pa, Va) ts 


aim) 
a (Xo: Xp) 


1 
, where (f,9) = 70 D> wf (Tw)9(Lu-2) 
wew 


for functions f,g: # > C. 


Proof. It follows from general results on the representation theory of 
symmetric algebras (see [20, Corollary 7.2.4 and §8.1.8]) that if x and 
x’ are irreducible characters of # then 


(x,x’) =0 if and only if x and x’ are non-isomorphic. (14.9) 


Thus, taking inner products with x, in (14.8), and using the facts that 
Xa(Tw) = tr(Pw) = bw,elA| and xp)(Te) = dim(p), the result follows. 


3.2.2. The Transition Probabilities p\ (a,y) 

We now return to question (2), seeking a formula for the n-step return 
probabilities p™ (2, y). In principal one could compute this probability 
by noting that it is the (2, y)*® entry of the matrix P”. Of course, this 
is not a practical method because P is a very large matrix (for example, 
for the smallest thick F4 building the matrix P has approximately 2 x 
10° rows and columns!). The following theorem gives a more practical 
solution to the problem. 


Theorem 3.8. Let P = \\awPy € F be the transition matrix of 
an isotropic random walk on a regular spherical building, and let T = 


Ye dw Ty € #. Then 
1 
p™ (a, y) = 7 Yo mpxp(T" Tyr) if 8(2,y) = w. 
pelrrep(.#) 
Proof. We claim that 
Xa(TuTy-1) = G1Albu,o for all u,v € W. (14.10) 


To see this, note that xya(Tw) = tr(pa(Tw)) = tr(Pw) = |Aldw,e 
(because each Py is a |A| x |A| matrix, with (c, y)th entry equal to 
1 if 5(z, y) = w and 0 otherwise). Thus, 


Xa(TuTy-1) = tr(PuPy)= D> cl’ -str(Pw) 
wew 


1 
_ C. palAl = ——|Axy(0)N Ay(o)|, 
: qu Qu 


where we have used Theorem 3.3, and (14.10) follows. 
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Now, if P? = al P,, then T” = af” Ty, and so 


a yn(Ty Ty-1) = q,,' a. 


vEeW 


1 
HACE? Ppt) = 
|A| - |A 


If 6(z,y) = w then p™(z,y) = q,ta®, and thus p\™(z,y) = 


xa(T” T,,-1)/|A|. The result follows from (14.8). 


3.2.3. Mixing Times 

Now we move to the more sophisticated question of mixing times. 
To begin with we need to define a notion of ‘distance’ between two 
measures. In the literature the total variation distance has become a 
standard choice, popularised by Persi Diaconis (see, for example, [17]). 
This distance is defined as follows: If 4 and v are probability measures 
on A then 


| — vilty = max |“(A) — v(A)]. 
ACA 


To work with the total variation distance it is helpful to observe the 
following elementary fact. 


Lemma 3.9. Let w be probability measure on A and let u: A > [0,1] 
be the uniform distribution. Then 


1 VIA 
le = ulltv = glu —ullis< 5 le — ull2 
where || - ||, and || - lz are the £! and €? norms respectively. 


Proof. Clearly || — ullty equals either |j2(A) — u(A)| or |u(B) — u(B)I, 
where A and B are the sets A = {7 € A | u(x) > u(x)} and B= {re 
A | u(r) < u(x)}. In fact 


|u(A) — u(A)| = [1 — w(B) — 1+ u(B)| = | (B) — u(B) 


and so || — ulliv = 3 ((u(A) — u(A)| + [u(B) — u(B)|) = $llu — alli. 
The final inequality follows from Cauchy—Schwarz. 


Let uw : A — [0,1] be the measure w™ (x) = p™ (0, x), where 0 € A 
is a fixed chamber of &. The following theorem gives a mathematically 
tractable upper bound estimate for the total variation distance || — 
ullty, and thus can be used to give upper bounds for mixing times. Define 
an involution *: #4 > # by (> aw Tw)* = Yo GwT,-1. Note that # 
has a one-dimensional representation privy (the trivial representation) 
given by ptriv(Ts) = 1 for all s € S (to check this, simply verify that the 
defining relations (14.7) are satisfied). 
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Theorem 3.10. (cf. [18]) Let P= ¥° dw Py be the transition operator 
of an isotropic random walk on the chambers of a regular spherical 


building, and let T = Y° dwTy € #. Then 


1 
We — ally $7 DL Moxe (T™(T"). 
PF Ptriv 


Proof. We have 


(n) 


lA lA 
= lle sha = wll} = = (uO = vw — a) 


|| He 1 


Now 


S eA, oe 
wu, wr = Sag) (a) = GxalTrry”, 


and so 


ik 
Wa — ul, = (xa T"(P)") — 1). 


Theorems 3.8 and 3.10, together with the multiplicity formula from 
Theorem 3.7, provide some basic theory for studying isotropic random 
walks on spherical buildings. To make more practical estimates in given 
examples one needs to work harder with the representation theory 
(for example, to give meaningful bounds on the right-hand side of the 
inequality in Theorem 3.10). Some calculations are made in [18], in a 
different context, that can be translated to give estimates for certain 
random walks on buildings © of type Ay (see [18, Proposition 7.4 and 
Theorem 7.5]). However, in the building theoretic context the walks 
covered in [18] are perhaps not the most natural (for example, the 
simple random walk is not covered). Thus, there is still a lot to do 
in this direction, and we hope that the setup provided above might 
stimulate some future research. In Appendix B we outline the details of 
the representation theory in the rank 2 case (that is, when the building 
is a generalised polygon). 


3.3. Random Walks on Affine Buildings 


Let (Xn)n>0 be a random walk on an infinite graph with transition 
probabilities p(x, y). Natural questions to ask in this setting include: 


(1) At what velocity does the random walk move to infinity? 

(2) What is the distribution of the fluctuations away from expected 
distance? 

(3) What are the asymptotics of p(™ (2, y)? 
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Appropriate solutions to these problems come in the form of a law 
of large numbers, a central limit theorem, and a local limit theorem 
(respectively). 

For random walks on affine buildings it is natural to consider both 
random walks on the chambers of the building, and random walks on 
the vertices of the building. The latter case now has a rather complete 
theory. We will consider both cases below. 

If the affine building has rank 2 then we are dealing with a random 
walk on a trees. In this context there is a huge literature which takes us 
too far afield to discuss here, and so we will focus on the higher rank case. 


3.3.1. Random Walks on the Vertices of an Affine Building 
Let R be an irreducible root system with coweight lattice P, and let 
(Wagt, Saft) be the associated affine Coxeter system. Let © be a regular 
affine building of type (Wag, Sag), and let V be the set of all special 
vertices of &. Recall the definitions of the vector distance function d(., -) 
from Definition 2.6. Some of the formulae of this section become more 
complicated in the case of Cx buildings with go # gn, and so here we 
will restrict to the case gg = dn for Cn buildings (see [36-38] for the 
general case). 

We now define isotropic random walks on the set V of all special 
vertices, and outline the algebraic and analytic theory that is used to 
analyse them. In Appendix A we will give more details in the specific 
case of C's buildings, where one can carry out the calculations ‘by hand’. 


Definition 3.11. A random walk (Xn)n>o0 on V is isotropic if its 
transition probabilities satisfy 


p(x, y) = p(a’,y') whenever d(a, y) = d(a’, y’'). 
For each « € V and A € Pt let 
Vi(2) = {y € V | d(x, y) =A} be the sphere of ‘radius’ 4 centred at z. 


The cardinality Nj, = |V;(a)| does not depend on « € V (see [87, 
Proposition 1.5]). If (Xn)nso0 is an isotropic random walk on V then 
there are numbers a, > 0 with }0,-p+ a@ = 1 such that 
p(x, y) = = for all y € Vi(z). (14.11) 
2 


In an analogous way to the case of isotropic random walks on chambers 
(see Proposition 3.2) the transition operator A of an isotropic random 
walk on the vertices of a regular affine building is of the form 


A= > mA, (14.12) 
AePt 
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where the numbers a, are as in (14.11) and the operator A, acts on 
functions f : V > C by 


1 
A,f (2) = a Pato 


Let & be the vector space over C with basis {A, | A € Pt}. The 
following is an analogue of Theorem 3.3 (the proof is, however, a little 
more involved). 


Theorem 3.12. [37, Theorem 5.24] The vector space & is a commu- 
tative associative unital algebra under composition of linear operators. 


The algebra & plays an important role in understanding isotropic 
random walks on the vertices of affine buildings. The key feature of 
Theorem 3.12 is that this algebra is commutative. In fact one can be 
more precise. 


Theorem 3.13. [37, Theorem 6.16] Let & be a regular affine building. 
Let P the the algebra of chamber set averaging operators on X (c.f. 
Theorem 8.3) and let & be the algebra of vertex set averaging operators 
on X (c.f. Theorem 8.12). Then & is isomorphic to the centre of F. 


For each a € R we write q = qi if a € Woa;, and let 


di 
r= I] Ie Os) for all A € P. 


aeRt 


If q = q for all i = 0,1,...,d then r* = q//? where t is the 
translation by 2. 

If u € Hom(P,C*) we write u* = u(A), and if w € Wo and u € 
Hom(P,C™*) let wu € Hom(P,C*) be given by (wu)* = u™ for all 
dX € P. For each 4 € Pt the Macdonald spherical function P, is the 
function P, : Hom(P,C*) > C given by 


— _ ,-1,,-aY 
AW = Woo > u”’*e(wu) where c(u) = I] aaa 


we Wo aeRt 


Vv ’ 


1-—u% 


where Wo(q7!) = Poe Wada 1 This formula requires, of course, that the 
denominators are nonzero, however it turns out that P,(uw) is a linear 
combination of terms wu with uw € P and so the ‘singular’ cases where 
a denominator vanishes can be obtained by taking an appropriate limit 
in the general formula. The Macdonald spherical functions arise in the 
representation theory of p-adic groups (see [34]). 

Theorem 3.13, combined with the Satake isomorphism, implies the 
following result, giving a complete description of the irreducible repre- 
sentations of &. 
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Theorem 3.14. [87, Proposition 2.1] For each u € Hom(P,C%) there 
is a one-dimensional representation ty of L given by my(A,) = Py(w). 
Moreover, every one-dimensional representation x of & is of the form 
T= Ty, for some u € Hom(P,C%), anday =a2y if and only if u' = wu 
for some w € Wo. 


Each A € o& maps €2(V) into itself, and ||A,fll2 < ||fll2. Thus, we 
may regard .o/ as a subalgebra of the C*-algebra Y(€2(V)) of bounded 
linear operators on €2(V). It is not hard to see that Aj = Aj» where 
A* = —wod (with wo the longest element of Wo), and thus & is closed 
under taking adjoints. Let be the completion of & with respect to 
the €?-operator norm || - ||. Thus, 2% is a commutative C*-algebra. By 
passing to this completion we ensure that the transition operator A of 
an isotropic random walk on V is an element of ~ (it is an element of 
the ‘uncompleted’ algebra « if and only if the walk has bounded range). 

The one-dimensional representations of .~% are precisely the extensions 
to & of the representations 7, : & — C which are continuous with 
respect to the €?-operator norm, and in [37, §5] it is shown that these are 
the representations 7, with u ¢ Hom(P,T) where T = {z € C | |z| = 1}. 
If wu ¢ Hom(P,T) and A € & we write A(u) = my(A) (the Gelfand 
transform of A). In particular, we have A;(u) = P,(u), and if A is the 
transition operator of an isotropic random walk as in (14.12) we have 


A(u) = Y~ a,P,(u) for all u € Hom(P,T). 
Ae Pt 


The final ingredient in the analysis of <&% is the calculation of the 
Plancherel measure. 


Theorem 3.15. [34, Theorem 5.1.5], [37, Theorem 5.2] Let du denote 
normalised Haar measure on U = Hom(P,T), and let uw be the measure 
on U given by 

Woq"') 1 


mH) = Tia Tee? 


Then 
1 ~ SS 
af Aj (u)Ay/(u) du(u) = 64" for all A, € PT. 
Ni Ju 


Theorem 3.15 implies that the n-step transition probabilities of an 
isotropic random walk with transition operator A are given by 


1 ~ oe 
p(n, 9) = / A(u)"AxWdp(u) if d(e,y)=a. (14.13) 
Ny Ju 
This is the analogue of Theorem 3.8, and is a key result in studying 


isotropic random walks on the vertices of affine buildings. Indeed the 
primary limit theorems (that is, the law of large numbers, the central 
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limit theorem, and the local limit theorem) can all be proven using 
the above machinery via techniques from classical harmonic analysis. 
These limit theorems were proved by Lindlebauer and Voit [33] for the 
case of A» buildings, and Cartwright and Woess [14] for the case of Ap 
buildings. The general case was settled by Parkinson [38], and the results 
are summarised below. 


Theorem 3.16. [38] Let (Xn)n>0 be an isotropic random walk on the 
vertices of a locally finite thick regular affine building. 


(1) Under the moment assumption >, <p+ |Ala, < 00 there exists y € 
E* such that 
d(o, Xn) 


lim ————— = y__ almost surely. 
n—0o n 


(2) Under the moment assumption >°, < p+ |A|? a, < 00, the vector 
(d(o, Xn) — ny)/Vn 


converges in distribution to the multivariable normal distribution 
N(0,T), where T is a positive definite matric. 

(8) Let y € Vi(2) andn EN. Suppose that (Xn)n>o is irreducible and 
aperiodic. Then 


p(x, y) = CP, AG" nM? (14 O(n), 


where C' > 0 is an explicit constant. 


Remark 3.17. We remark that the precision of Theorem 3.16 is really 
quite impressive, with explicit formulae for the speed, variance, radius 
of convergence, and all asymptotic constants (see [38] for details). In 
the local limit theorem the assumption of aperiodicity may be removed, 
see [38]. 


Heat kernel and Green function estimates for finite range isotropic 
random walks on affine buildings have been_ obtained recently by 
Trojan [55] (with earlier results obtained for A, buildings by Anker, 
Schapira and Trojan [2]). The starting point for this analysis is again 
formula (14.13). Estimates for the Green function are given within the 
radius of convergence, and at the radius of convergence. For example, at 
the radius of convergence Trojan proves: 


Theorem 3.18. [55, Theorem 7] The green function of a finite range 
isotropic random walk on the special vertices of an affine building of 
rank r, evaluated at the radius of convergence, satisfies 


[o-e) 

= mary tae + 
Y PMane SPO 
n=0 
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Finally, convergence results for isotropic random walks on affine 
buildings to Brownian motion in a Weyl sector have been studied by 
Schapira, at least in the context of nearest neighbour random walks on 
A,r buildings. We now describe this result. Let (Xn)n>0 be a symmetric 
nearest neighbour random walk on the vertices of an Ay. building with 
transition probabilities p(x, y). Let be the spectral radius of (Xn)n>0 
and let (Yn)n>o0 be the random walk with transition probabilities 


Pao yy) al 
Q(z, y) = p(z, y) =—— p 
Pao,z)Q) 


(it is easily seen that this defines a random walk on the building using 
(37, Theorem 3.22]). Schapira proves the following (see [46, § 2] for the 
relevant definitions of Brownian motion): 


Theorem 3.19. [46, Theorem 6.1] With the notation as above, the 
sequence (Z/")¢>0 with 
i 1 
Zp = wm Yin) 


converges in law to Brownian motion Ut)t>0 in the sector $9 as n > oo. 


3.3.2. Random Walks on the Chambers of an Affine Building 
The literature on isotropic random walks on the chambers of affine 
buildings is currently less complete. A key reason for this is that 
the algebra Y of averaging operators on the chambers of an affine 
building is noncommutative. Thus, the Plancherel theorem for this 
infinite dimensional noncommutative algebra is rather sophisticated (see 
[35] and [40]). The general approach to the primary limit theorems is 
outlined by Parkinson and Schapira in [39], and the detailed calculations 
are carried through for Ag buildings. The general case is in preparation 
by the author. 


Theorem 3.20. [39, Theorem 3.7] For the simple random walk on the 
chambers of a thick Ag building with thickness q > 1 we have 


p™ (x,y) = Cap'n (1 + o(n-¥?)) if S(2,y) = w 


where Cy is an explicitly computable constant (depending on w and 
q only), and where the spectral radius p is given by p = B(¢-—D)+ 


vq? +34q4+ 1)/6q. 


Remark 3.21. Assuming a suitably transitive group action, a formula 
for the spectral radius for an isotropic random walk on the chambers 
of an affine building can be deduced from results of Saloff-Coste and 
Woess [44]. In particular, see [44, Example 6]. 
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3.3.3. Regular Sequences in Affine Buildings 

Recently Parkinson and Woess [41] proved the ‘p-adic analogue’ of 
Kaimanovich’s characterisation [29] of regular sequences in symmetric 
spaces. This theory has applications to random walks on buildings and 
associated groups, and we describe this here. Recall the definition of the 
vector Busemann functions hg from Definition 2.7. 

It is convenient to work with a natural ‘metric realisation’ of the 
affine building &. By the construction in Section 1.3 we may regard the 
apartments of an affine building as tessellations of a Euclidean space, 
and thus there is a metric on each apartment. Using axioms (B2) and 
(B3) it can be shown that these metrics may be ‘glued together’ to make 
x into a metric space (see [1, §11.2]). By [1, Theorem 11.16] this metric 
space is a CAT(0) space. In this section, we will regard affine buildings 
as metric spaces, although we also remember the underlying simplicial 
complex structure. The vector distance and the Busemann functions 
(originally only defined for vertices) naturally extend to give a vector 
distance and Busemann function for any points z, y of the building (see 
[41] for details). 

Let A € Et. A A-ray in © is a function t: [0,00) > © such that 


d(v(t1), t(#2)) = (2 —t1)A for ally >t > 0. 
Because we are specifying both a speed and direction, the notion of a 
A-ray is a refinement of the usual notion of a ray in a CAT(0) space. 


Theorem 3.22. [41, Theorem 3.2] Let (%p)n>0 be a sequence in X, 
and let} € E*. Lets be a sector of &. The following are equivalent: 


(1) There is a A-ray t: [0,00) > & such that d(an,t(n)) = o(n). 

(2) d(an, In41) = 0(n) and hg (fn) = Nis + 0(n) for some ws € Woda 
(independent of n). 

(3) d(tn,%n41) = o(n) and d(o, Zp) = na + o(n). 


A sequence (%p)n>o0 satisfying any one of the above equivalent 
conditions is called a A-regular sequence. This is a direct analogue of 
Kaimanovich’s results on symmetric spaces [29], and a generalisation of 
results of Cartwright, Kaimanovich and Woess on homogeneous trees 
[13]. We will discuss applications of Theorem 3.22 to random walks on 
affine buildings and associated groups in this section and the next. 

Since © is CAT(O) we define the visibility boundary 0X in the usual 
way as the set of equivalence classes of rays (with two rays being 
equivalent if the distance between them is bounded). The standard 
topology makes © = UA into a compact Hausdorff space (see Bridson 
and Haefliger [6, §I1.8.5]). Points of the visibility boundary are called 
ideal points of &. Given € € 0X and « € &, there is a unique ray in the 
class € with base point x ([6, Proposition II.8.2] or [1, Lemma 11.72]). 
We sometimes denote this ray by [x,&). Thus, one may think of dD as 
‘all rays based at x’ for any fixed x € &. 
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Definition 3.23. A random walk on V is semi-isotropic if the 
transition probabilities of the walk depend only on the vectors d(z, y) 
and h(y) — A(z). 


Clearly isotropic random walks are semi-isotropic, but not vice-versa. 
For each A € P let Hy = {x € V | h(a) = A}. As shown in (41, 
Proposition 4.6] semi-isotropic random walks are ‘factorisable’ over P, 
in the sense that the value of the sum 


DA, L) = a p(z,y) withaA,we Pandze A, 
yeH, 


does not depend on the particular  € H, chosen. Moreover, we have 
pAtv,u+tv)=p,u) for alla, u,v e P. 


In other words, if (Xn)n>0 is semi-isotropic then the sequence h(X;,) € P 
is a translation invariant random walk on P with transition probabilities 
pA, “). Since P = Z¢ the random walk (A(Xn))n=0 is well understood 
from the classical theory, and using Theorem 3.22 we obtain the following 
result for the original random walk (Xn)n>o0 on the building. 


Theorem 3.24. [41, Corollary 4.8] Let (Xn)nso be a semi-isotropic 
random walk on V. Under the finite first moment assumption >) <p 
pO, v)|v| < co we have 


1 
lim —d(o,Xn)=A_ almost surely, 
no n 
where A is the dominant element in the Wo-orbit of u = >>, <p PO, v)v. 
Moreover, if X 4 0 then (Xn)n>0 converges almost surely to an ideal 
point Xoo. 


The drift-free case (when A = 0) is more subtle. A weaker form 
of convergence of the random walk in this case is established in [41, 
Theorem 4.15] for nearest neighbour random walks. 


3.3.4. Random Walks on Groups Acting on Affine Buildings 
Limit theorems for isotropic random walks on the vertices of affine 
buildings imply limit theorems for bi-K-invariant probability measures 
on groups acting sufficiently transitively on the building, where K is the 
stabiliser of a fixed (special) vertex of the building. This is completely 
analogous to the chamber case of Proposition 3.4 (see [38, Remark 2.19] 
for some details). For example, If G = G(Q,) is a Chevalley group 
over the p-adic numbers, and if K = G(Zp) with Z, the ring of p-adic 
integers, then Theorem 3.16 gives a local limit theorem for the density 
function of a bi-K-invariant probability measure on G (see Example 2.9). 
In the prototypical example of G = SLq+1(Qp) one can remove the 
bi-K-invariance assumption, at the cost of losing explicit formulae for 
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the spectral radius. We expect the following result of Tolli to hold for 
more general Lie types, however at present it is only available for SDg+1. 


Theorem 3.25. [54] Let G = SLq4,(F) where F is a local field. Let f 
be a continuous compactly supported density of a probability measure on 
G such that 


(1) f is symmetric, that is f(x) = f(x—!), and 
(2) the support of f is a neighbourhood of the identity that generates G. 


Then there exists a number p > 0 and a positive function w : G— Rso 
such that 


pn t4t+2)/2 F) _, y pointwise as n > oo. 


Let © be a regular affine building, and let G be a subgroup of the 
automorphism group Aut(~). Let o be a Borel probability measure on 
G, such that the support of o generates G. We say that o has finite first 
moment if 


/ d(o, go) da(g) < o~. 
G 


Let (gn)n>o be a stationary sequence of G-valued random variables with 
joint distribution o The right random walk is the sequence (Xn) n>0 with 


Xp=o and X,=91-:-gno forn>1. 


The theory of regular sequences (Theorem 3.22) implies the following 
result for the right random walk. 


Theorem 3.26. (41, Theorem 4.1] Let G and o be as above, and 
suppose that o has finite first moment. Let (Xn)n>0 be the associated 
right random walk on X%. There exists 1 € Et such that 


1 
lim —d(o,X,)=A almost surely, 
non 


and for each sector s of X there exists Us € Wodr such that 
1 
lim —hg(Xn) =u almost surely. 
nwo n 
IfX #0 then (Xn)nz0 converges almost surely to an ideal point Xoo. 


3.4. Random Walks on Fuchsian Buildings 


Probability theory for buildings and related groups of non-spherical, 
non-affine type is in its infancy. Recently isotropic random walks on 
the chambers of Fuchsian buildings have been studied by Gilch, Miller 
and Parkinson, and a law of large numbers and a central limit theorem 
have been obtained. In this case the Hecke algebra has less controllable 
representation theory than in the spherical and affine cases, owing partly 
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to the existence of free group subgroups in the Coxeter systems. Thus, 
the representation theoretic techniques that have worked so nicely for 
the spherical and affine cases do not seem to help. 

Instead the arguments rely much more heavily on the underlying 
hyperbolic geometry of the building and the planarity of its apartments, 
with the general ideas adapted from the work of Haissinski, Mathieu and 
Miiller [25]. In this work the planarity and hyperbolicity of the Cayley 
graph of a surface group are exploited to develop a ‘renewal theory’ 
related to the automata structure of the group. In the setting of Fuchsian 
buildings the apartments of the building are planar and hyperbolic, and 
so similar ideas can be applied to the apartments. To lift this to the 
entire building requires some more work, and in [21] a theory of cones, 
cone types and automata for Fuchsian buildings paralleling the more 
familiar notions in groups is developed to achieve this goal. The idea is 
to find a decomposition of the trajectory of the walk into aligned pieces in 
such a way that these pieces are independent and identically distributed. 
Roughly speaking, one fixes a recurrent cone type T and sets R; to be 
the first time that the walk visits a cone of type T and never leaves this 
cone again. Inductively one defines Rn+1 to be the first time after Ry, 
that the walk enters a cone of type T and never leaves it again. The main 
results of [21] are as follows. 


Theorem 3.27. Let X& be a regular Fuchsian building and let (Xn)n>0 
be an isotropic random walk on A with bounded range. Then, 

1 a.s. i[d(XR XR ) 

—d(o, X = i" >0 > oo. 

- (0, Xn) — v [Ro — Fi] asn 
Theorem 3.28. Let X be a regular Fuchsian building and let (Xn)n>0 
be an isotropic random walk on A with bounded range. Then, with v as 
in Theorem 3.27, 


d(o,Xn)-—nvu D 5 2 El(d(Xp,,Xp,) — (Ro — Riv)? 
ir > N(0,0°), where o*= E[Ry — Ry] : 


3.5. Future Directions 


We conclude the main body of this chapter by listing some future 
directions and open problems in the theory of random walks on buildings: 


1) Provide sharp mixing time estimates and establish cut-off phe- 
nomenon for natural random walks on spherical buildings (in 
particular, for the simple random walk). 

2) Prove a law of large numbers, a central limit theorem, and a local 
limit theorem for isotropic random walks on the chambers of affine 
buildings (generalising [39]). 

3) Establish a local limit theorem for p-adic Lie groups of general 
type (generalising [54]). 
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(4) Prove convergence properties for the right random walk on a group 
acting on an affine building in the drift free case (c.f. [41]). 

(5) Give an explicit formula for the spectral radius of a random 
walk on a Fuchsian building or Coxeter group (or any other non- 
spherical non-affine building or Coxeter group). There are some 
trivial ‘tree-like’ examples, although apart from these no explicit 
formulae are known. Efficient algorithms, or asymptotic formulae 
in the thickness parameter, would also be interesting in lieu of an 
explicit formula. 

(6) Prove a precise and explicit local limit theorem for a non-spherical, 
non-affine building. 

(7) Derive heat kernel and Green function estimates for random walks 
on the chambers of affine buildings (extending [55]). 

(8) Generalise the Brownian motion convergence results of [46] to 
arbitrary type. As a first step one might consider either the rank 2 
cases, or remove the nearest neighbour restriction from [46] for 
walks on Ap buildings. 


A. Isotropic Random Walks on the Vertices of a Coe 
Building 


In this appendix we carry out the details of the outline given in 
Section 3.3.1 in the special case of an affine building of type Cy. These 
calculations were made by the author some years ago, in collaboration 
with Donald Cartwright, following the calculations made in the Ap case 
by Cartwright and Mtotkowski [12]. The calculations here are very much 
‘hands-on’, and do not require as much machinery as the general case. 
See the author’s thesis for some further calculations for Go buildings and 
so called BC'2 buildings. = 

Let © be a building of type C2 with thickness parameters qg = q2 = q 
and q, = r. Let V be the set of all special vertices of &. The root system 
and fundamental coweights @1 and w2 are illustrated in Figure 14.2(b). 
If 1 = kw, + lwg we write Va(r) = Vz i (2). 


Lemma A.1. The cardinalities Ny = |Vx,1(@)| do not depend on 
ze V, and we have 


Net = (+ Dt Dart DEE?) Teno 
Neo = (r+ (gr + Ya(q? rr?) 
Nor= (+ Dart Dero, 


Let us illustrate Lemma A.1 with an examples (the general argument 
is an induction). Figure 14.8 shows part of an apartment of Y. Panels 
with thickness q are shown as solid lines, and panels with thickness r 
are shown as dashed lines. The vertex y is in V1 9(z). 
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Figure 14.8 Computing the cardinalities | Vz )(x)| 


Let A be a chamber of the building containing x. There are q‘r? 
galleries in the building starting at A and ending at a chamber in 
position B, and each of these end chambers contains a vertex in 
V1,2(x). Moreover, every vertex in Vj,2(%) can be reached by such a 
gallery starting at some chamber containing z, and different starting 
chambers result in different end vertices in Vj,2(z). Thus, | Vi1,2(z)| = 
Kq‘r?, where K is the number of chambers containing z. The set 
of chambers containing x is a spherical building of type C2 (that 
is, a generalised quadrangle) with parameters (q,r). Thus, K = 
(qt 1)(r + I(qr + 1), and hence Mig = (q+ 1)(r + I(ar + 1) 
qre. 

For each pair k,! > 0 define an operator Ag acting on functions 
f:V—-Cby 


1 
A = —— 
nO = HE en 


Every isotropic random walk (Xn)n>0 on V has transition operator A 
of the form 


with ag.) > 0 and )°q 130 ak, = 1. Explicitly, ag. = P[Xn41 € Ve1(2) | 
Xn = 2] = p(x, y)/Np,1 for any y € Ve.i(). 


Theorem A.2. The following formulae hold, where in each case the 
indices m,n are required to be large enough to ensure that the indices 
appearing on the right are all at least 0. 
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Aj,0Ao0,1 = Ao,1A1,0 
N1,0Am,nA10 = TAm+in—2 + (GQ — D+ DAmn+ Pr? Amtin 
+ q?rAm—1n42 + Am-1n 
Ni,0A0,nA1,0 = (7 + DAin-2 + (@— D(r + Aon + Pr (r + DALn 
Ni0AmoA1,0 = ar(q + DAm-1,2 + Pr? Am4i0 + (¢— I) Amo + Am-1,0 
Ni 0Am 1410 = Pr? Am4i1 + PrAm-1,3 + Am-11 + (ar +9 —DAm1 
No,1AmnAo,1 = Am,n—1 t+ arAm+i,n—1 + ¢Am-1n41 + 7 rAm,n+t 
No,1A0,n 40,1 = Ao,n-1 + @?rAon41 + ar + DAI n—1 
No,1Am,0A0,1 = (¢+ DAm-1,1 + ar(¢+ DAm,- 


Proof. From the definition of the operators Amn we have 


1 Nu,v 
Nm,nNs,t 


Am nAs,tf(t) = >> | Vinn (2) 1 Vs,2WlF(Y) 


uU 
u,v=0 y 


” ye Vu,v (a) 
(14.14) 


(see [36, (3.1)] for some intermediate steps). The formulae in the theorem 
follow by computing the cardinalities |Vimnjn(z)  Vs4(y)| in the cases 
(s, t) = (1,0) and (s, t) = (0,1). We will give the calculation for (s,t) = 
(0,1) and m,n > 1, leaving the remaining cases as an exercise. 

Suppose that y € Vz, 7(x), with k,/ > 1, and consider the intersection 
Vij(@) M Vo,1(y). It is clear from Figure 14.9 that if this intersection is 
nonempty then 


(4,9) € {(k,t—-1),(&K-1,0+1), (Kk +1,1—-1), (0+ D} 


(these are the four points marked with o in Figure 14.9). Using some basic 
building theory we see that |Vzi-1(7) N Voi(y)l = 1, | Ve-141(@) 9 
Voi! = a, |Veri-1(@) 9 Voy)! = ar, and | Ve 144(2) NO Voiy)| = 
q’r. Thus, for large enough m,n > 1 we have 


Figure 14.9 Computing the intersection cardinalities 
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1 if y © Vinn4i1 (2) 
if y € Vintiyn—1(2) 
if y © Vmn-1,n41(2) 

qr if y € Vinjn—1(2). 


|Vimnjn(x) a Vo,1(y)| = 


It follows from (14.14) and Lemma A.1 that 


1 
Am,nAo,1 sS Wy P TAmnt + qrAm+1,n-1 Sa gAm—1,n41 oF Am,n-1)- 
The remaining formulae are similar, with some care for small values of 
m and n. 


Let & be the linear span of {Am,n | m,n > 0} over C. 


Lemma A.3. The vector space & is a commutative unital algebra over 
C, generated by Ai, and Ag,,. Moreover, & is isomorphic to C[X, Y| 
(the algebra of polynomials in commuting indeterminates X and Y), with 
an isomorphism given by X + Ato and Yr Ag. 


Proof. Let ~ be the total order on N2 given by (k, 1) ~ (m,n) if either 
k+l <m+nork+l= m+n and k < m. An induction using 
this total order and the formulae in Theorem A.2 shows that for each 
(m,n) € N? and (k,l) € N? the product Am,n Ak, is a linear combination 
of terms A;,; with (i,j) € N?. Thus, o is a unital algebra (with unit 
Ao,o = I). Moreover, for each (m,n) € N? an induction shows that there 
is a positive number Cm,» > 0 such that 


Am,n = Cm nA oAG 1 + a linear combination of AF Ab 1 
with (k,l) < (m,n). (14.15) 
Thus, & is generated by A1,9 and Ag, and hence is commutative as 
Aj,0A0,1 = Ao,1 A1,0- 

It follows that there is a surjective homomorphism yw: C[X, Y] — # 
with w(X) = Ajo and W(Y) = Ao. Suppose that z = Ya, X*Y! € 
ker(wW) is nonzero, and let (m,n) € N? be maximal subject to mn # 0. 
Then (14.15) implies that 

0 = (2) = hn Am,n + linear combination of terms Ag) 
with (k, 1) < (m,n) 


for some Cri # O, contradicting the linear independence of the 
operators Az j. Thus, y is injective, and so 7 =C[X, Y]. 


Let Cp be the group of signed permutations on two letters, acting on 
pairs of nonzero complex numbers (21, 22) by permutations and inversions 
(for example, there is an element o € C2 with o(21, 22) = (ao z1)). This 
group of order 8 is the Weyl group of type C. 
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Theorem A.4. For each pair (21,22) of nonzero complezr numbers 
there is a one-dimensional representation Tz, 2, of M given by 


(qr) (q/r)—” 
d4+qH)ad4¢r-Has qir-}) 


Tz 22 (Am,n) _ 


> C(Z@ (1); %(2))* (1) % (2) 

oEC2 

where 

(—q tata )a-_qlata)a—rtey 4d — rte?) 
d—z'2a)da-z'!a)a-4%0d-%”°) 


c(41, 2) = 


whenever 21, 22, a a are pairwise distinct, and if 21, 22, ae, a5 are 


not pairwise distinct then the formula for z,,2.(Am,n) 1s obtained from 
the above formula by taking an appropriate limit. Moreover every one- 
dimensional representation n of & is of the form m = 12,2 for some 
2,22 € C%, and ry» = Toh oh if and only if (2,2) = o(4, 22) for 
somea € Cy. 


Proof. By Lemma A.3 we have #@ = C[X, Y]. The one-dimensional 
representations of C[X, Y] are precisely the evaluation maps X # u 
and Y + v, and thus for each (u,v) € C? there is a unique one- 
dimensional representation 2“) of </ determined by 1%) (Ay 9) =u 
and m4) (Ag 1) = v, and all one-dimensional representations are of this 
form. 

Let (u,v) € C? and write amin = (gr)™(qV/r)"t%” (Amn). Let 
u = (qr)-! Nou and v’ = (q./7r)~1 Nov. Applying z to the formulae 
in Theorem A.2 gives: 


Mk Capi = A@m+1,n—2 + ad-— qa a a + Am+1,n 


+ Am-1,n+2 + @m—1,n (14.16) 
u' don = (+r) (a1n-2 + 1 — gan + a1,n) (14.17) 
u' Am,0 =(1+ q!)am—1,2 + Gm+i,o + ( — qr amo + A@m-1,0 
(14.18) 
u'dm,1 = Om+1,1 + Im-1,3 + Im-1,1 + (1+ rt gr aa 
(14.19) 
VU Omn = Omn—1 + Gm4ijn—1 + Gm—-1,n41 + Omyn41 (14.20) 
v'a0,n = 0,n-1 + dong. ++ r7")atn—1 (14.21) 
v' amo = (1 + 7! )(am—1,1 + am,1), (14.22) 


where in each case the indices m,n are required to be large enough to 
ensure that the indices appearing on the right are all at least 0. From 
(14.20) we have 


’ 
@m+i1jn—1 + @m—1,n+1 = V Am,n — m,n—1 — Gm, n+1 for all m,n >= 1, 


JAMES PARKINSON 434 


and using this equation in (14.16) gives 


Tees =(1- qd + ON a ei a7 (Qm+41,n—2 a @m—1,n) 
F (Qm+1,n =F Am—1,n+2) 
=(- gyal ae po aie + (v'amjn—1 — am,n—-2 — am,n) 
a Cg ners — am,n — Am ,n+2)s 
valid for all m > 1 and n > 2. A similar calculation using (14.21) and 


(14.17) shows that the above formula also holds for m = 0. By replacing 
n by n+ 2 and rearranging we obtain 


/ / 
Am nt4 — V Amn+3 + COmn+2— VU Amnt1+ mn =O0 for all m,n >= 0, 


(14.23) 


where a = 24+ u/—(1—q7!)(14+1r7!). The auxiliary equation of this 
linear recurrence (in n) factorises as a product of two quadratics: 44 — 
VAR + aa? — vA 41 = (A2 — aA +1)? — 0A +1), and so the roots of 
the auxiliary equation are of the form 2, 2, A, 22, 2 ! for some numbers 
21, 22 € C*. By Newton’s identities we have 


von x - (a ~—gh)d¢rh+(atea Det %')) (14.24) 
qJ/r ( =< =) 

v= ytz- tat+z : 14.25 
ce 1 9 ( ) 


Writing 1x 2 = %”) whenever (z1, 22) € (CX)? and (u,v) € C? are 
related as above, it follows from (14.24) and (14.25) that 2,2. = 2 
if and only if (21, 25) = (20(1), %(2)) for some o € Cy. We now verify that 
x ,29(Am,n) is given by the formula in the statement of the theorem. 

Assuming for now that 2, 2 e; 22, 2a ! are pairwise distinct, solving 
the recurrence (14.23) gives 


Om, n=C1,m (21, 22)2] + Com (21, 22) 25 + C3, m (21, 22)2, "+ Cam (21, 22) 2” 
1 2 


for suitable functions Cj,m(z1,22) (independent of n). Writing 
Cima, 2) = Cm(“, 22), the invariance under the group C2 implies 
that 


Ca m(a, 22) = Cm(2z2, 21), C3,m(21, 2) = Cm(z,', 5), 


Chm (A, 22) = CmnlZa eae 
and also that Cm(<z1, ae) = Cy, (21, 22). Thus, for all m,n > 0 we have 


Omin = Cm(2, 2)27 + Cm (22, 2) 28 + COm(z*, Da” 
Cie ee (14.26) 
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Writing Cm = Cm(21, 22) it follows from (14.26) and (14.20) that 
4 Cm+2 — (2+ %)Cmtit2Cm=0 forallm>0 (14.27) 


we have used the fact that v’ = 2 + gt zo + z,'). The roots of 
1 2 


the auxiliary equation of the recurrence (14.27) are 2122 and 22, 5 and 
these are distinct by hypothesis, and hence 


Cm = D(a, 22)2{" 25" + D'(a, za)2{" 2%" for all m > 0 


for suitable functions D(z, z2) and D’(z1, z2) independent of m. Since 
Cm (21, 22) = Cis”) we have D’(z1, 22) = D(z, 5), and thus by 
(14.26) we have 


ann = BS DZ); 2 (2) 254) 22) for all m,n > 0. 
oECo 


To compute D(z1, 22) we proceed as follows: Using the recurrence 
formulae we obtain explicit formulae for ao, 40,1, 49,2 and ap,3 in terms 
of z1 and zg. In particular, a9,9 = 1, and aj,9 = gru and ag = qJ/rv 
are given by (14.24) and (14.25). Then (14.21) with n = 1 gives 
a2 = v'agi—-1—(1+r aio. By (14.21) with n = 2 we have 
a0,3 = v'a9,2 — 29,1 —- + rYara, and aj,1 is computed using (14.22) 
with m = 1 giving ayy = v'(1+ q')-ta10 —ao,1. This gives the initial 
conditions of the recurrence (14.23) with m = 0, and thus the coefficients 
in (14.26) (with m = 0) can be computed, giving 


(1 — qigie ha - q iz lw) - gr ee 
(t+¢-h)+ qtr-h — eta A — ep) — 277) 


Co(21, 22) = 


From (14.21) and (14.26) we see that C1(21, 22) = see Cp, and thus 
the initial conditions of the recurrence (14.27) are known. Thus, we can 
solve for D(z, 22), and we find that D(z, 22) = c(1, 22)/(+ gq ya+t 
r-ya+ qtr), with c(z, 22) as in the statement of the theorem (a 
computer algebra package is recommended for these calculations). 
Thus, the formula for zz, z. is verified in the case where 21, 22, 2, eS oe 

are pairwise distinct. Generally, from (14.24), (14.25), and the fact that 
Aj,o and Ag generate & we see that (Amn) is a polynomial in 
21, ae 22, ae Thus, in the case where 21, ae 22, eZ are not pairwise 
distinct we can obtain the formula for mz,.2.(Amjn) by taking an 
appropriate limit. 


Let T = {t € C | [t| = 1}, and let dt denote normalised Haar 
measure on T. Thus, for integrable functions f on T we have J, f(t) dt = 
se fe" fe!) do. If A € &f and t € T? we write A(t) = 14(A). The 
following theorem establishes the Plancherel formula for the algebra /. 
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Theorem A.5. We have 


1 oe oa 
wl Ag bH)Amn(b) du(t) = 5(k,1),(m,n)> 
k,l JT2 


where du(t) is the measure 


1 
du(z)=K sdtidtz, with K = —A+q7!)dtr)d+q'r}). 
|c(21, 22)| 8 
: (Co) (0,0) 
Proof. Since AgjAmn = Paes Ck), (rn) Ais and C(e.2),(mjn) = 


5(k,1) (m,n) Nel it suffices to show that Spe Ax, it) du(t) = 5(k,1),(0,0)- 
Using the facts that |c(t1, t2)|? = c(tec, isa) ete ays a) for (t1, tz) € 
? and o € Cy, and that fy f(t7') dt = fp f(t) dt we have 


peel l 


t 
i Ay (t)du(t) = 5a "qvry ye ps eee ay, ity 


Gy Cte cay» 5a) 


= (gr) * (avr)! i, ( : es 
T\JT c(t, ‘. is) 


As a contour integral, the inner integral is 


fhe 1 atte day 
Sigaig “la ae I 
Melty je 2nt Jr c(zy*,ty') 4 
where I is the unit circle traversed once counterclockwise. The poles of 
the function f(z) = 1/e(z,', i") are at 71 = ites qt2,+./r and since 


|to2| = 1 and q,r > 1 we see that f(z,) has no poles inside the contour I’, 
and so by residue calculus we deduce that 


k+l 2 
tt. 1 1—t. 
1 4 D 
~*~ dty = 8k. 0,0 pet ay ee 0.0) <> - 
L e416) NE 0-day Pd See 
Thus, 
x -# 
i Ag i(t) du(t) = 50,0.00) f a dia = 8(&,),0,0)- 
T2 tl-r vty 


Theorem A.6. Let (Xn)n>0 be an isotropic random walk on X with 
transition operator A. Then 


1 
pa) = ac AOA det) ify € Veale). 
k,l 


Proof. We have A” = }); 530 4; A, ij Where p\™(z,y) = a Ni, 
whenever y € Vj,j(z). Thus, ee Theorem A.5 we have, for any pait x i 
with y € Vz 1(2), 
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be A(t)” Ap,(t) du(t) = Y> a," “ae Aig An duct) 


4,j20 


= Neag) = NEP (a, y)- 


The asymptotics for the n-step transition probabilities of the random 
walk (that is, the local limit theorem) can be extracted in a standard way 
from Theorem A.6. Let us simply illustrate this in an example. Consider 
the ‘simple random walk’ on V with transition operator Ao,i (this is the 
random walk such that if X, = y in Figure 14.9, then X41 is one of 
the Noi = (¢+ 1)(¢r + 1) vertices marked with o in the figure, each 
chosen with equal probability 1/No,1). We can compute Ao, (t) from the 
general formula in Theorem A.4, however a shortcut is given by (14.25), 
giving 
qT 


HOM ee gre 


(ott +0+4"). 


The simple random walk is periodic, with period 2, and so we consider 
p?2” (x, x). Writing et = (e1, e*2) we have 


2q./7r (cos 61 + cos 62) 


ma 10\ __ 
AQES = ae Gea) 


and 


1 — 46? — 63)?6?63 
eee Gas ar 


+ Ola’). 


Let p = 4q/r/(q + D(gr +1) and K’ = K/(7?- q')4Q— 14)». 
Some standard tricks from asymptotic analysis now give 


Ao,1(e"*) 
(2n) on do 
p (a, 2) = i =. ye Te(ei®)]2 d0; dg 


0 0 
K! ~ Kip a Ga (6? — 03)0263 dO, do. 
—e TC, 


61 + cosé 
~ 2K! ee i (coertooseny (02 — 03)76?203 dO, dé 
—e J-€ 


ao fe [ ‘(see + TE) 
2 


(9? = ys)? gigs dg dg2 


2K’ PO eS 
oe pen f : e (91 +92)/2 (2 = ys) 293 dg, dg2, 
n —0o J—0O 
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and thus in conclusion we have 


~ Sa Dees ear Neer? ( Aqa/r )" a 


(2n) 
p-™ (x, 2) a(q—1)4(r —1)4 (q+ (ar + 1) 


B. Rank 2 Hecke Algebras and the Feit-Higman 
Theorem 


In this appendix we present the representation theory of rank 2 spherical 
Hecke algebras and apply it to Theorems 3.8 and 3.10. As a byproduct 
we arrive at a proof of the Feit-Higman Theorem (this proof is due to 
Kilmoyer and Solomon [31]). Let (W,S') be the Coxeter system of type 
In(m). Write S = {s1, s2}, and write gs, = q and gs, = r (with q = r if 
m is odd). Let T; = Ts, for 1 = 1,2 be the generators of the abstract 
Hecke algebra #. The classification of the irreducible representations of 
H is elementary: 


Proposition B.1. [20, Theorem 8.3.1] In the above notation: 


(1) If m is odd then the complete list of irreducible representations of 
H is as follows: There are precisely 2 one-dimensional irreducible 
representations, given by 


Ptriv(11) = 1 , feats ae 


an i 
Ptriv(T2) = 1 Psgn(T2) = —q7 
and precisely (m — 1)/2 2-dimensional representations, given by 


1/-1 0 1l[q ¢. 
((T)) = - and (12 =<| i] 
pj(T1) ap | p5(T2) alo = 
forl <j < (m—1)/2, where cj and Cj are any numbers satisfying 
C5 c = 4q cos?(1j/m). 
(2) If m is even then the complete list of irreducible representation of 


KH is as follows: There are precisely 4 1-dimensional representa- 
tions, given by 


Ptriv(T1) = 1 Ptriv(T2) = 1 
Psen(T1)=—G' —— Psgn(T2) = 
p(T) =1 p'( Ta) = —r 
p(Ti)=—q* p>(T) = 1. 

There are exactly (m — 2)/2 two-dimensional representations, 


given by 


1/-1 0 lfr ¢ 
; — ra 7 = J 
p(T) = = I ‘l and pj(T2) = = lo 5 


BUILDINGS, GROUPS OF LIE TYPE AND RANDOM WALKS 439 


forl <j < (m—2)/2, where cj and Cj are any numbers satisfying 
Cj =qtr+2/q cos(2nj/m). 


Proof. Itisastraightforward exercise to show that the claimed formulae 
produce representations (by checking that the defining relations (14.7) 
are satisfied). It is also easy to check that the representations are 
irreducible. To check that they are pairwise non-isomorphic one can 
compute the inner products and check (14.9), and finally to check we 
have all irreducible representations one uses the character theoretic fact 
that > dim(p)? = |fo(m)| = 2m, where the sum is over all irreducible 
representations of #. See [20, Theorem 8.3.1] for details. 


The representation theory described in Proposition B.1 allows us to 
be extremely precise for random walks on generalised polygons. Let us 
simply illustrate these arguments for the case of generalised quadrangles 
(that is, m = 4): 


Corollary B.2. For the simple random walk on a generalised quad- 
rangle with parameters (q,1r) we have 


1+ kA} + head + kag + ka (at +17) 
(¢+D(r+D(qr+) 


p™ (0, 0) = 
and 
1 
Ne — ulley < > (HAR + bodZ™ + byaZ" + byQQ" +229)), 


where the numbers kj, Aj, and As are given by ky = q?r?, ko = r2(qr+ 
D/(q+r), ke = Cr+D/G+r), ke = ar(a+Dr+)/G+), 
Ar = -2/(q+7r), 42 = (@- D/G+7), A3 = (r-1D/(q+ 1), and 
Ag = (Q+r—-24Vq—r)? +4(q+7))/2G +7). 


Proof. For the first statement, from Theorem 3.8 we have 


p(0,0)= aa YS axe”), 
[A pelrrep(#) 

where T = a Ty + aa T2 (since we are considering the simple random 
walk). We have |A] = (¢+1(r+1)(qr+1), and by Proposition B.1 there 
are five irreducible representations of # with respective multiplicities 
Mrivy = 1, Megn = Q?r?, m! = r?(qr+1)/(q+r), m? = @(qr+1)/(q+n), 
and m = qr(¢qt+ (r+ 1)/(¢q +7) (computed using Theorem 3.7). We 
have Xtriv(I”) = 1, Xsen(T") = (-2/(q +1)", x'(2") = (q— D/(at+ 
ry)", x°(T) = (r-D/(q+r))", and a calculation of eigenvalues gives 
xu(T”) = 10% +A". The result follows, and the second statement follows 
similarly from Theorem 3.10, noting that T* = T. 
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Figure 14.10 Irreducible spherical Coxeter systems 


By considering the formulae for the multiplicities of irreducible 
representations in the geometric representation we obtain a proof of 
the Feit-Higman theorem and some of the divisibility conditions from 
Theorem 2.12 (c.f. [31]): 


Proof of the Feit-Higman Theorem and Divisibility Conditions. Supp- 
ose that a finite thick generalised m-gon exists with parameters (q, 1). 
Let x; be the character of the representation p; from Proposition B.1. 
By Theorem 3.7 we have (x;,x;) € Q. On the other hand we can 
explicitly compute these inner products. Writing 0; = 27j/m, a tedious 
calculation gives 


2 
2m + ao if m is odd 
IAN X55 Xj) = nee ee 
Sak: (r(q—-1)?+q(r-1)?)m + (q-1)(r—1)m cos 6; it are Aieuety, 


2gr sin” 0; Ja sin? 0; 
If m is odd, then the formulae force cos6; to be rational, and this 
implies that m = 3. If m is even, then the above formulae imply 
that both sin?(22/m) and cos(2x/m) are rational (consider (x1, x1) + 
(X(m/2)—j» X(m/2)—7) to see that sin? (27 /m) is rational). Together these 
facts imply that m € {2, 4,6, 8}. 

The divisibility conditions for the cases m = 4,6,8 follow by 
computing the multiplicity of p! using Theorem 3.7. The facts that 
J@ € Zand ./2qr € Z for hexagons and octagons (respectively) arise 
from the multiplicity of 1. 


C. Spherical and Affine Coxeter Systems 


In this final appendix we list the Coxeter diagrams of the irreducible 
spherical and affine Coxeter systems. For the affine systems, the extra 
generator that is added to the spherical system is indicated by o. 


BUILDINGS, GROUPS OF LIE TYPE AND RANDOM WALKS 441 


2 oe e% ey e--e gs 


An, % > 2 Bn, n = 3 Cn, n > 2 
—_ o—o | oe o—e—e | Q—_o—_9 
Dn >4 Ee Ex 
_l. eo—o o—e ety ® oO oS, 
i Fy Go 


Figure 14.11 Irreducible affine Coxeter systems 
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1. Introduction 


This work is concerned with questions related to a number of recent 
studies where “stable-like” processes and random walks are considered. 
We focus on random walks on groups, mostly nilpotent groups and 
groups of polynomial volume growth, associated with various types 
of spread-out probability measures. Here, “spread out” is used in a 
nontechnical sense to convey the idea that these measures do not have 
finite support. We will be interested in two main cases. In the first case, 
the measure is spread out in a radial way with respect to a fixed word 
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length on G. In the second case, the measure is spread out along the 
one parameter groups {s;" : m € Z} corresponding to a given finite list 
of generators (s1,..., 5%). In the second case, the measures are “more 
singular” and the study of their convolution powers is more delicate. 
Given a (symmetric) probability measure v on a (finitely generated) 
group G, we consider the discrete time random walk (X;)¢° driven by 
v and started at Xo = e. This means that X, = &...En, n => 1, where 
(€){° is an i.i.d. sequence of G-random variables with common law v. We 
denote by P, the law of the sequence (Xn) with Xo = g. The distribution 
of Xp, itself is the convolution power v‘”). We also consider the associated 
continuous-time random walk X; whose distribution is given by 


[oe tr 
pe(g) =e!) (9). 
— nl 


This continuous-time process will serve as a tool in the study of the 
discrete random walk driven by v, a technique that has been used by 
many authors before. 

The question addressed in the present work is the following. Assuming 
good upper bounds on v"")(e), under which circumstances can one prove 
matching lower bounds? Further, can one describe (in a certain sense) the 
region in G where, for a given n, v'™(g) ~ v™ (e)? We provide answers 
for measures v that are quite natural and for which well-understood 
existing techniques are insufficient and/or need to be modified. 


1.1. Main Definitions 


Definition 1.1. We say that || - || : G > [0,co) is a norm on G if 
|g|| = 0 if and only if g = e and, for all g,h € G, ||ghl| < |lg|| + |]A]| and 
lIgil = lg~* I. Given a norm ||-||, we say that V(r) = #{g € G: Ig <r} 
is the associated volume function. 


The simplest and most common example is of a norm provided by the 
word length associated to a given finite symmetric set of generators. We 
will encounter other norms as well. 

The properties studied in this work are the following. 


Definition 1.2. Let ~ be asymmetric probability measure on a group 
G. Let || - || be a norm with volume function V. Let r : (0,00) > 
(0,00), t+ r(t), be a nondecreasing function. Let (Xn)¢° be the random 
walk on G driven by jw. We say that y is (||-||, 7) controlled if the following 
properties are satisfied: 


1. For all n, w@™(e) ~ V(r(n))7!. 
2. For all e« > 0 there exists y € (0,00) such that 


O<k<n 


P.( sup {|| Xzll} = yr) <€. 
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The first of these two properties is rather straightforward and self- 
explanatory. It provides a two-sided estimate for the probability of return 
of the random walk. In more general contexts, this property is also known 
as a two-sided “on-diagonal” bound. The second property is related to 
the first insofar as it actually easily implies the lower bound w?™ (e) > 
V(cr(n))~!. It also provides a weak control of the behavior of p‘”) (g) 
away from the neutral element e. 


Definition 1.3. Let 4 be asymmetric probability measure on a group 
G. Let || - || be a norm with volume function V. Let r : (0,00) > 
(0,00), t + r(#), be an increasing continuous function with inverse p. 
Let (Xn)¢° denote the random walk on G driven by jz. We say that pz is 
strongly (|| - ||, 7) controlled if the following properties are satisfied: 


1. There exists C' € (0,00) and, for any x > 0, there exists c(k) > 0 
such that, for all n > 1 and g with ||g|| < «r(n), 


e(k) V(r(n)))+ < weg) < CV(r(n)). 


2. There exists €,y1, v2 € (0,00), y2 => 1, such that, for all n,t such 
that 5e(t/v1) <n < p(t/v1) 


inf {p. ( sup {||Xzl} S yet; ||Anll Ss ‘| ze. (15.1) 


x:\|z\|<t O<k<n 


Strong control implies the following useful estimate. The last section 
of this chapter gives an application of this estimate to random walks on 
wreath products. 


Proposition 1.4. Assume that r is continuous, increasing with inverse 
p and that the symmetric probability measure ww is strongly (|| - ll, 7) 
controlled. Then, for any n and t such that yir(2n) => Tt, we have 


x: ||e\|<t O<k<n 


inf {P.( sup {Il Xgll} < 273 || Xnll < ‘| see ren eGM), (152) 


Proof. By induction on € > 1 such that 1 < @ < 2n/p(t/y1) < (€+ 1), 
we are going to prove that 


inf {P. ( sup {||Xxl} S vat; |Anll < ‘| ae, 


L:\|z\|<t O<k<n 


This easily yields the desired result. For € = 1, the inequality follows 
from the strong control assumption. Assume the property holds for some 
£ > 1. Let n,t be such that (€+ 1) < 2n/p(t/y1) < (€ + 2). Choose n’ 
such that n — n’ = [e(t/y1)/2] and note that 2n’ € [1,(€+ De(t/y1)). 
Write Z, = supzen {|| X~||} and, for any x such that |||] < 7, 
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Py (Zp < yot; ||Xnl| < 7) 


= Pe (Zn S y2t3 (|Xn'll < 5 Aull < 7) 


n'<k<n 


> Pz ( S yat WXnill st; sup {I|AXgll} S yet; Anil S ‘) 


= Ex (Uz,,<ytXylse}PX, (Zn—n’ < 727 Xn—n’ll < t)) 
S<eP5(Zy Seri Xa on Se. 


This gives the desired property for €+ 1. 


Example 1.5. Suppose wu is strongly (|| - ||, 7) controlled with r(t) = 
t!/8 for some fixed f € (0, 2]. Then the probability that the random walk 
driven by yu started at the identity element e exits the ball {g : ||g|| < tT} 
before time n is bounded below by e~¢¢+"/?"), 


1.2. Word-Length Radial Measures 
Let the group G be equipped with a generating k-tuple 


S = (81,.-+; 8) 


and the associated finite symmetric set of generators S = (Cae saheg’S aa 
Let |g| be the associated word length, that is, the minimal & such that 
g=u...ug with ue S,1<i<k. By definition, the identity element 
e has length 0. Hence, |-| isa norm and (x, y) > |x~!y| is a left-invariant 
distance function on G. Let 


Vs(r) = #{G :l|gl <r} 


be the volume of the ball of radius r. We say that G has polynomial 
volume growth of degree D if Vg(r) ~ r? in the sense that the ratio 
Vs(r)/r? is bounded away from 0 and oo for r > 1. Finitely generated 
nilpotent groups have polynomial volume growth and, by Gromov’s 
theorem, any finitely generated group with polynomial volume growth 
contains a nilpotent subgroup of finite index. More precisely, any finitely 
generated group G such that there exist constants C’, A and a sequence 
np with V(ng) < Cn contains a nilpotent subgroup of finite index and 
thus has polynomial volume growth of degree D for some integer D. See, 
e.g., [8]. 


Example 1.6. Let G be equipped with a word-length function | - | 
associated with a symmetric finite generating subset. Assume that G 
has polynomial volume growth. The main results of [10] imply that, for 
any symmetric probability measure j with finite generating support, ju 
is strongly (|-|,t t+ V#) controlled. The main results of [1, 2] show 
that, if vg is symmetric and satisfies vg(g) ~ [(1 + |g|)* Vig)! with 
B € (0,2), then vg is strongly (| -|, ft t1/8) controlled. See also [3, 14]. 
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One example that motivates the present work is the case of the 


measure 
Cc 


(+ lg)2Vdgl) 


Can one provide good estimates for vs (e) on groups of polynomial 
volume growth? The following theorem gives a very satisfactory answer 
to this question and covers not only this particular example but the full 
range of cases passing through the classical threshold corresponding to 
the second moment condition. 


vo(g) = 


Theorem 1.7. Let G be equipped with a word-length function | - | 
associated with a symmetric finite generating subset. Let V be the 
associated volume function and assume that G has polynomial volume 
growth. Let @ : [0,co) > [1,co) be a continuous regularly varying 
function of positive index. Let r be the inverse function of 


© sds 
t ey f=. 
EET ei) 


Let vg be a symmetric probability measure such that 


vo (9) ~ (15.3) 


1 
dg) Vgl 
Then vg is strongly (|-|,7r) controlled. 


Example 1.8. Assume that ¢(t) = (1 + t)%e(t) with & positive 
continuous and slowly varying (we refer the reader to [4, Chap. I] for the 
definition and basic properties of slowly and regularly varying functions. 
The scaling function r of Theorem 1.7 can be described more explicitly 
as follows. 


e If B>2,r(t)~ tl. 
e If B < 2, we have a a ~ cg @(t) and r is essentially the 
inverse of ¢, namely, 


r(t) = tV/8 Cage (21/?) 


where €4 is the de Bruijn conjugate of £. See [4, Prop. 1.5.15]. For 
example, if £ has the property that €(t%) ~ ¢€(¢) for all a > 0 then 
Car age: 

e The case 6 = 2 is more subtle and the proof is more difficult. The 


t ; : 
function yp: tr fo se is slowly varying and satisfies y(t) => Tay: 


For example, if 2 = 1, we have w(t) ~ logt and r(t) ~ (tlogt)!/?. 
When ¢(¢) = (log t)” with y € R then 

—Ify>1, W(t) ~ 1and r(t) ~~ t!/?; 

— If y =1, W(t) ~ loglogt and r(t) ~ (tloglog t)/?; 

—Ify <1, w(t) ~ dogt)!” and r(t) ~ (tog t)!-”)1/?. 
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1.3. Measures Supported by the Powers of the Generators 


In the critical case when @ is regularly varying of index 2 and vg 
has infinite second moment (i.e., >> Ig\?vg (g) = oo), the proof of 
Theorem 1.7 makes essential use of some of the results from [18], which 
are related to variations on the following class of examples. Recall that G 
is equipped with the generating k-tuple S = (s1,...,s,). For any k-tuple 
a= (Qj,...,@%) € (0, 00)*, and consider the probability measure 1¢,¢ 
supported on the powers of the generators s1,..., 5, and defined by 


ee _ 1 
mE 


Set 


a; = min{a;,2} and a, = max{a;,1 <7 < k}. 


Define 
ue ~ 
llgs=mingr:g=[[sf:g=+1, #U:y =a <r 
j=l 
(15.5) 
Note that g > |lglls,a : G > [0, 00) is a norm. Consider also the measure 
c(G, a, B) 
Vs ,a,8(9) = (15.6) 


(1+ Ilglis,a)? Vs,a(llgils,a) 
with B € (0, 2). 

Under the assumption that G is nilpotent and {s; : a; € (0,2)} 
generates a subgroup of finite index in G, it is proved in [18] that there 
exists a positive real Dg,q such that 


Qs,a(r) = #Allglis,a < r/°*} = P82 
and 
ue) < Cg gnPs.4, vg, g(e) < Cga,pnPs.2/8 | 


Here we prove the following complementary result. 


Theorem 1.9. Let G be a finitely generated nilpotent group equipped 
with a generating k-tuple S = (s1,..., 5%). Referring to the notation 
introduced above, fir a € (0, 00)* and assume that {s; : a; € (0,2)} 
generates a subgroup of finite index in G. 


e The probability measure wg,q 18 strongly (| - llsja,t te t1/e) 
controlled. 
e For any B < (0,2), vg.a,p is strongly (\|- lls,a, tt t!/B) controlled. 
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Remark 1.10. In [18], a detailed analysis of the subadditive function 
ll - Ils,a and the associated geometry is given. This analysis is key to 
the above result and to its proper understanding. For instance, it is 
important to understand that the parameter a, is not necessarily a 
significant parameter. It is the quantity || - II's. q that is the important 
expression. Indeed, for any given nilpotent group G, [18] describes 
conditions on two pairs of tuples (S, a), (”, a’), 


S=(si)F € G*, a =(ai)h € 0,00)", 8’ = (8) € Ga! = (a) €(0, 00)", 


such that ||- "Sa ~ |l- Siar Because the geometry ||gll3,¢ is studied and 
described rather explicitly in [18], the above results give rather concrete 
controls of the random walks driven with wg,q or vs,a,p- 


On the one hand, in the case of the measures vg,q,g and with much 
more work, it is possible to improve upon the statement of Theorem 1.9 
and obtain a full two-sided pointwise bound on vg \q,g. Indeed, based on 
the results of [18], it is proved in [14] that, for all g € G and n> 1, 


5,a,p\9) ~ ; ~ min : 
oP (WB + IIglls,a)*O80¥8 ere aire « 


On the other hand, in the case of the measures g,,, Theorem 1.9 
provides the most detailed result available at this time. Indeed, available 
techniques do not seem to be adequate to provide a sharp two-sided 
bound for w3,(9) and finding such a two-sided bound is an interesting 


open problem. 


1.4. A Short Guide 


Section 2 is based on well-known variations of the celebrated Davies off- 
diagonal upper-bound technique. Our key observation is that, even in 
cases where we do not expect to obtain full off-diagonal upper bounds, 
the Davies technique provides enough information to prove control in the 
sense of Definition 1.2. 

Section 3 describes the notion of pointwise pseudo-Poincaré inequality 
(a variation on the idea introduced in [6]) and shows how, with the help 
of the underlying group structure, a pseudo-Poincaré inequality allows 
us to upgrade control to strong control. 

Section 4 applies the earlier results to a family of probability measures 
and random walks introduced in [18]. These measures are supported on 
the powers of the given generators. They provide examples for which no 
good off-diagonal upper bounds are known at this time. Nevertheless, 
the results developed here apply and capture useful properties of the 
associated random walks. 
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Section 5 is concerned with radial-type measures where radial refers 
to a given norm on the group G. The simplest and most interesting 
case is when this norm is taken to be the usual word length associated 
with a finite symmetric set of generators and, in this case, we prove 
Theorem 1.7. 

Section 6 describes the applications to a class of random walks on 
wreath products. The notion of strong control (on the base group of the 
wreath product) leads to lower bounds on the probability of return on 
the wreath product. 


2. Davies Method, Tightness, and Control 


2.1. Davies Method for the Truncated Process 


In this section, we review how Davies’s method applies to the continuous- 
time process associated with truncated jump kernels. We follow [13, 
Section 5] rather closely even so our setup is somewhat different. The 
first paper treating jump kernels by Davies method is [5]. 

Throughout this section G is a discrete group equipped with its 
counting measure. Fix a norm g +> ||g|| with volume function V and 
set d(x, y) = ||c~'y||. Note that d is a distance function on G. Consider 
the left-invariant symmetric jump kernel 


I(x, y) = v(a*y) 
associated to a given symmetric probability measure v on G. For R > 0, 
define 
Sk= >> v(z)and G(R)= > aI? va), (15.7) 
Iz|>R Iz sk 


JIn(z,y) = J(z, yNaa,y<ry Ip y) = J(z, y)laqe,yy>R) and vp = 
Ip(e,"). 

Denote by p(t,z,y) and pr(t,z,y) the transition densities of the 
continuous-time processes associated to J and Jp, respectively. In 
particular, 


[o,e) 
2 = ” _ 
p(t, x,y) = p(x ly)=e ae ly). 
0 nN: 


Let 
1 
EFA =O) = 5 EM —-fyyrI@,y) (15.8) 


xy 


be the corresponding Dirichlet form and set also 


1 
Er.) = 5) 0@) —f@)" Ir@, 9). 


zy 
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Note that 
1 
EG N-ExGN=5 De i@O-fMP Jey 
x,y:d(x,y)>R 


< Do F@? +f) I, y) < IG SR. 


z,y:d(x,y)>R 
Consider the on-diagonal upper bound given by 
VireG,t>0, pile) < md), (15.9) 


where m : [0,00) > [0, 00) is a continuous, regularly varying function of 
negative index at infinity and m(Q) < oo. Since the function t Rh m(t) 
may present a slowly varying factor, we follow [13]. The starting point 
is the log-Sobolev inequality 


>of log < eEr(f, f) + 2e6p +log me) IIfIl5 + IIFll5 log Iiflla (15.10) 


with « > 0 which follows from (15.9) by [7, Theorem 2.2.3]. The following 
technical proposition is the key to most of the results obtained in later 
sections. The logarithmic Sobolev inequality (15.10) is one of the main 
tools used to prove this proposition, which is essentially a corollary of 
[13, Section 5.1]. 


Proposition 2.1. Assume that the on-diagonal upper bound (15.9) 
holds with m regularly varying of negative index. Then there is a constant 
C' such that, for all R,t > 0 anda € G we have 


ee f  \ lall/aR 


Remark 2.2. The bound in this proposition is better than the uniform 
bound ppr(t, e, x) < Ce Rt m(t) only when t < R?/G(R). 


Proof. It suffices to consider the case t < R?/G(R). Starting with 
(15.10), we apply Davies method, as described in [13, Section 5.1] to 
estimate pr(t, e, x). Let 
ARG)? = max {e“™*PR(e%, eM Vllos le™ Fae, loo | 
with 
Pa, W(@) = lve) — v7 JRC, 9), 
y 


then by [13, Corollary 5.3], for any w with finite support, 


prt, £,y) < C(t) exp (45nt + 72A R(t — vy) + W@)). 
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Consider the case x = 2 and y = e. Fora > 0, set w(z) = A(lzoll—llzI) 7 
and write 


z 
e MOP Re, (2) = Yo (VO-¥ ~1) Inte, 9) 


IA 


eR YW — VY TREY) 
y 


<r eF S© yl? vy < Re? F G(R). 
lyl<R 


The last inequality uses the elementary inequality t?e~! < 1 with t = 
Rx. By inspection, the same estimate holds for e2V@P p(e-¥, e~Y)(z). 
As w(e) =A |laoll, we obtain 


Patt, €,%) = pr(t, a, €) < Cm(t) exp(15gt + 72R-*e?G(R)—A lanl). 


Since t < R?/G(R), we can set 


i R? 
— lo 
3R ° tG(R) 


so that the second term 72tR~2e?*"G(R) is a constant. This yields the 
stated upper bound. 


2.2. Control 


Meyer’s construction is a useful technique to construct the process X;5 
by adding big jumps to X2. See, e.g., [12] and [2, Lemma 3.1]. In this 
section, we combine the off-diagonal upper bound in Proposition 2.1 
with Meyer’s construction to derive control-type results for the process 
with jump kernel J. Our goal is to show that, for a certain choice of 
continuous increasing function r(t), for any e > 0, there exists constant 
y > 1 such that 


P. (su |Xs |] = yrv) <6. 
s<t 


Let X a denote the process with truncated kernel Jp. It follows from 
Meyer’s construction that (see [12] and [2, Lemma 3.1]) 


P.(X, Xf for some s < t) < tdp. 
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For any r > 0, y > 1, both to be specified later, we have 


P. (sw Xs || = yr) 
s<t 

< Pe (sep 

s<t 


< Pe (sep 


S<t 


xs 


> yr) +P, (x # xe for some s < t) 


xs 


> yr) + tdR 


< 2sup;Pe (|x 


s<t 


> yr/2) vs (15.11) 
The inequality 


P,. (sep 
s<t 


is a version of the André reflection principle obtained by considering the 
stopping time 7; = inf{s : XFI > yr) and using the (strong) Markov 
property and the triangle inequality to write 


P. (su 
s<t 


xe 


> yr) < 2sup {P. (|x 


s<t 


= yr/2)| 


xs 


=) sPo(| 3] = 722) 


+P. (sup |xF| >yr and |x| < yr/2) 


s<t 
R 
er, ([x#] 2707) 
+Ee (117, <P xe (|Xt_7, ll ze yr/2)) 


< 2sup;Pe (|x? > yr/2)}. 


s<t 


The last inequality uses group invariance. This will be helpful in deriving 
the following result. 

In what follows, we say that a positive increasing function V is 
doubling if there exists a constant Cyp such that, for all p > 0, 
V(2p) < Cyp V(p). 


Proposition 2.3. Assume that the volume function V is doubling. 
Assume also that v is such that (15.9) holds where m is regularly varying 
of negative index. For e > 0, fiz a function R(t) such that 


t est 


PURE 7 CON. se eamrccmny = © 
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Let r(t) > R(t) be a positive continuous increasing function such that 


sup { m(t) Viren" RO] < 00, 
t>0 
Then, for any € > 0 there exists a constant y > 1 such that 
Pe (sup | Xsll = yrv) <&. 
s<t 


In particular, we have 
l-e 
e,e) > —————.. 
V(yr(t)) 
If, in addition V(r(t)) & m(t), then the measure v is (|| - ||, 7) controlled 
an continuous time. 


pet, 


Proof. Proposition 2.1 implies that for s < t, 


¢  \tal/3R 
pR(S, €, 2) < Cm(s) (os) 


5\lzI|/3R t x|/3R 


Fix R = R(t), r = r(t) = R, decompose {z : ||z|| => br} into dyadic 
annuli {x : ||z|| © 2’yr} and write 


yr 
Pe ([x"] = 5) 
< CY ms) Cau 2 TBR Y Qty p(t) 
i=0 
7  m(s) (8\2 77/3 _oi-ty rar ( VQtyr) 
= OOD 2. mb) (-) e€ y (ee ) 


Let Cyp denotes the volume-doubling constant of (G, d), then 
4 
VQ'yr) < hp. 
Viyr) 


Recall that m(t) is a regularly varying function with negative index. 
Hence, for y large enough, we have 


aly /3 
M= __ sup — (=) | <0Oo 
O<s<t,icN m(t) \t 


Viyr) < Cyn®’ Von), 


Therefore 


[o,@) 
P. (|x? = a) < Oym(t) V(rye"T/OR * e-27/12GF,. 


i=0 
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By assumption, r = r(t) and R = R(t) satisfy 
sup {m(e) VEE OG) eee 
t>0 


It follows that for y sufficiently large, we have 


r (|x! 


Y éE 


Plugging this estimate into (15.11), we obtain P, (sup,<z{ |Xs ll} = yr(t)) 
<€é. 


Corollary 2.4. Under the hypotheses of Proposition 2.3, for any € > 0 
there exists y > 0 such that 


Pe (sep |Xsll < 2yr(@), ||Aell S yr(0) rae ea 
Proof. Write 
Pe (sw |Xsll < 2yr(@), ||Aall S yr(o) 
= Pe (lXell < yr@) — Pe (sx |Xsl| = 2yr(2), lXell < yr(v) 
>1-2P, (sep | Xsll = yr(o) : 
s<t 


Note that we have used the fact that, because of space homogeneity (i.e., 
group invariance), X; cannot escape to infinity in finite time. 


Remark 2.5. The conclusions of Proposition 2.3 and Corollary 2.4 apply 
to the associated discrete-time random walk. To see this, fix a regularly 
varying function m and note that (up to changing m to cm for some 
constant c), (15.9) is equivalent to v?@”)(e) < m(n). Further, it is easy 
to control the difference between Pe(||X¢|| > 7) and Pe(||Xn|| = 7) with 
n = |t] as long as n is large enough. It follows that the proof above 
applies the discrete random walk result as well. 


3. Pseudo-Poincaré Inequality and Strong Control 


3.1. Pseudo-Poincaré Inequality 


With some work, the results of the previous section can be extended 
to the more general context of graphs and discrete spaces. The results 
presented below make a more significant use of the underlying group 
structure. 
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Definition 3.1. Let G be discrete-group equipped with a symmetric 
probability measure v, a subadditive function || - || and a positive 
continuous increasing function r with inverse p. We say that v satisfies 
a pointwise (|| - |], 7) pseudo-Poincaré inequality if, for any f with finite 
support on G, 


VgeG, D> If(eg) —f@)? < Co(lgNEE.A)- (15.12) 
LEG 
Here €, is the Dirichlet form of v defined at (15.8). 


Theorem 3.2. Assume that (G,||- ||) is such that V is doubling. Let 
v be a symmetric probability measure such that v(e) > 0. Assume that r 
is a positive doubling continuous increasing function such that 


veM(e)~ V(r(n))t. 


Assume further that v satisfies the (|| - ||, 7) pseudo-Poincaré inequality. 
Then there exists n > 0 such that for all n and g with |lg|| < nr(n) we 
have 


v(g) = V(r(n))y*. 
Proof. The hypothesis (15.12) and the argument of [10, Theorem 4.2] 
gives 
1/2 
[v2ntN) (zy — p2nrt+N)(e)| < o (SS) yer) (e), 
Fix x and n such that (|x|) < nn and use the above inequality with 
N = 2n to obtain 
vAnr(n) > (1 = (2c'n)"?) vane), 
Hence, we can choose 7 > 0 such that 
ven (z) > cv@ (e). 


Because v(e) > 0, this also holds for 4n + 7, 1 = 1,2,3, at the cost of 
changing the value of the positive constant c. 


3.2. Strong Control 


Definition 3.3. We say that || - || is well connected if there exists 
b € (0,00) such that, for any r > 0 and z € G with ||z|| < r there exists 
a finite sequence of points ae € G with ||x;|| < 2r, llaj ve+all <b, 
wm =eand ty = 2. 


Note that in this definition, the number N of points in the sequence 
is finite but that it depends on x and no upper bound in terms of r is 
required. 
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Lemma 3.4. Assume that || - || is well connected and V is doubling. 
Then for any fixed € > 0 there exists Me such that for any r > 8b/e and 
any ||z|| < r we can find Cas m= e, zy = 2, M < M, such that 
lz; *zeqall < er. 


Proof. Let {yj :0< 1% < M’} be a maximal er/4-separated set of points 
in B(e,2r) = {\lg|| < 2r}. The ball Bj = B(yj,er/9) are disjoints and 
have volume V(er/9) comparable to V(2r). Hence M’ < M: for some 
finite M{ independent of r. The union of the balls BY = B(yj,€r/4) 
covers B(2r) (otherwise, {y; : 0 < i < M’} would not be maximal). Fix 
xz with ||xz|| < r and let Can be a sequence of points witnessing well- 
connectedness for the point «. By construction, the balls Bj, 1 <i< M’, 
cover the sequence (as) and we can extract a sequence B* = B’, 0 < 
i < M < Mj, such that Bj 5 e, By, > 2, inff{ilgll : 9 € BY} < 4, 
inf{\|zg~"|| : g € BY,_,} < b and 


inf {14-1 gl :he BY,ge Bi} Speci. NSO) 


Set 2 =, 4% = yj,,1< i < M-1, zy = 2. Then |Z), a|| < 2er/4t+b < 
er as desired. 


Proposition 3.5. Assume that the norm ||-|| is such that V is doubling 
and || - || ts well connected. Let r be a positive continuous increasing 
doubling function. Let v be a symmetric probability measure that is 
(\|- |], 7) controlled and satisfies v(e) > 0 and a pointwise (||- ||, 7) pseudo- 
Poincaré inequality. Then v is also strongly (|| - ||,) controlled. 


Proof. First, we show that for any x > 0 there exists c, > 0 such that 
|x || < «r(n) implies 

v™ (a) > eV(r(n))?. 
By Theorem 3.2, there exists 7 such that v™(r) > cy V(r(n))7 for all 
|z|| < nr(n). By Lemma 3.4, for any fixed « there exists M, such that 
for any ||z|| < «r(n) we can find Care 2m =e, zy =z, M < M,, such 
that ||; 'zisill < nr(m)/4. Write By = {g : lz; ‘gll < nr(m)/4} and 


pila) (a) > yea Oe na) oD 
(yi, ym e@M B; 
> of Var(ny/4y4 Vey te 4 Vert. 
Given that v(e) > 0, this shows that ||z|| < «r(n) implies v™ (2) > 


cV(r(n))~! as stated. In particular, for any fixed x, there exists « > 0 
such that for any z,n with «r(n) <T and |z|| < T, 


Pz(|Xnll < t) 2 €. 
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Now, fix yi € (1,00). Let «9 > 0 be such that, for any xz, n, t with 
lz] <t < mr@n), 


we have P,(||X,|| < t) => €9. Let y => 1 be given by Definition 1.2 so 
that 


P, (sna > yr) > €9/2. 


k<n 


Set yo = y/yi +1 and, for any x,n,t with ||z|| < t and 5p (t/y1) <n< 
p(t/y1), write 


P, (stan S vat, |Xnll S ‘) 


k<n 


= Py (|Xell < t) — Px (sx |X| 2 vat, |Xnll S ) 


k<n 


>eo—P- (s | Xz] = rin] > €9/2. 


k<n 


This proves that v is strongly (|| - ||, 7) controlled. 


As a simple illustration of these techniques, consider the case of 
an arbitrary symmetric measure v with generating support and finite 
second moment (with respect to the word-length | - |) on a group with 
polynomial volume growth of degree D(G). It follows from [16] that 
v™(e) ~ n-P(@/2 and satisfies a pointwise classical pseudo-Poincaré 
inequality (with p(t) = t?). Proposition 3.5 yields the following result. 


Theorem 3.6. Let G be a finitely generated group with polynomial 
volume growth with word-length |-|. Assume that v is symmetric, satisfies 
v(e) > 0, has generating support and satisfies )~ Igl?v(g) < oo. Then v 
is strongly (|-|,t > t!/*) controlled. 


4. Measures Supported on Powers of Generators 


The goal of this section is to explain the proof of the part of Theorem 
1.9 that deals with the measures gq. 
4.1. The Measure j5,q 


In this subsection we consider the special case when G is a nilpotent 
group equipped with a generating k-tuple S = (s1,...,5,) and 


J(a,y) = bs,a(aty), @= (@1,...,a%) € (0,00)* (15.13) 


with wg,q given by (15.4). Our aim is to prove the first statement in 
Theorem 1.9. The study of the random walks driven by this class of 
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measure was initiated by the authors in [18] and we will refer to and use 
some of the main results of [18]. 

Following [18, Definition 1.3], let t be the power weight system on 
the formal commutators on the alphabet S associated with setting w; = 
1/a;, &; = min{a;,2}. Namely, the weight of any commutator c using 
the sequence of letters (s;,,..., 5;,,) from S' (or their formal inverse) is 
w(c) = 07" wi. Recall also from Section 1.3 that a* = max{a;} and 


Qs,a(r) = # {9 IIgllsa srl". 
In [18], the authors proved the following result. 


Theorem 4.1 ([18]). Referring to the above setting and notation, 
assume that the subgroup of G generated by {s; : a; < 2} is of finite 
index. Then there exists a real Dg .q = D(S,t0) such that 


Qs,alr) = rP%e, wg le) Sn PH. 

The real Dg.q = D(S,t0) is given by (18, Definition 1.7]. Further, there 
exists a k-tuple b = (B1,--- , Bk) € (0,2)" such that B; = a; if aj < 2, 
D(S, a) = D(S, 6), and 


Vg EG IlgIS , ~ lgls,- 
In addition, Wg,q satisfies a pointwise (|| - llg,a,t te t1/) nseudo- 
Poincaré inequality. 

The volume estimate Qg a(r) = Qs5,9(7) = rs, shows, in particular, 
that (G, || - lls,5) has the volume-doubling property. The upper-bound 
ws (e) < Cn7P3,0 implies that the continuous time process with jump 
kernel J defined above satisfies 

Vt>0,2EG, p(t,x,x) < m(t) = Ct?PS, 


Note that || - || is clearly well connected (Definition 3.3). To apply 
Propositions 2.3 and 3.5 to the present case and prove Theorem 1.9, it 
clearly suffices to prove the following lemma, which provides estimates 
for dp and G(R). 


Lemma 4.2. Referring to the setting and hypotheses of Theorem 4.1, 


for J given by (15.13), let || - |] = ll -lls,5, D = Ds,p = Ds,a, we have 
Vir) = #19 € G: Ig srp~ PP, 
dp RFs, 
RG acm eae 


Proof. The volume estimate follows immediately from Theorem 4.1. 
Let » be the power weight system associated with b (in particular, 
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uj = 1/6; > 1/2). By [18, Proposition 2.17], for each i there exists 
0 < B. < Bi < Bx < 2 such that 


[si = InP. 


In the notation of [18], B; = 0j,(s;).-We have 


=> HS,a(Z) = > os aes 


Ic|>R i=l ||s?|>R 


~ > s ai = TESTE ~ os R7 Bx oti / Bi, ~ RoPs, 


1=1 1. phx/B; i=1 
The last estimate uses the fact that there must be some 7 € {1,...,k} 
such that a; = £/ and that, always, a; > Bi. 
Similarly, since a; > Bi and B, < 2, we have 26'/B, — aj > 0. This 
yields 


il 


G(R) = Yo all ey DS ile 


IzIl<k t=1 ||s?||<R 
K;|n|2i/B« 


k k 
~ ~ 2—B.0i/B; ~ R2-Bx 
ae 2 (1 + [nfyites =a es 


t= 1 gen < phe/6; 


This proves Lemma 4.2. 


4.2. Some Regular Variation Variants of ws ¢ 


Consider the class of measure wz of the form 


wg) = ty dX xia (15.14) 


ile (1+ [mpi 


where each £; is a positive slowly varying function satisfying ¢;(¢°) ~ 
£;(t) for all 6 > 0 and a; € (0,2). For each i, let F; be the inverse 
function of rh r%/£;(r). Note that F; is regularly varying of order 
1/oj; and that F,(r) ~ [ré;(r)|!/%, r > 1, i = 1,...,k. We make the 
fundamental assumption that the functions F; have the property that for 
any 1 < 1,7 <k, either Fj(r) < CF;(r) of Fj(r) < CFj(r). For instance, 
this is clearly the case if all w; are distinct. Set a = (@1,...,a@%) € (0, 2k 
and consider also the power weight system v generated by vj = 1/aj, 
1<i<k, as in (18, Definition 1.3]. Fix ag € (0,2) such that 


ao > max{a;:1<i< k} 
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and ag/a; ¢ N, i = 1,...,k. Observe that there are convex functions 
Kk, > 0,1=0,...,k, such that K;(0) = 0 and 


Vr >1, Fy(r%) ~ Kj(r). (15.15) 


Indeed, r  Fi(r%) is regularly varying of index ag/a; with 1 < 

ao/a; ¢ N. By [4, Theorems 1.8.2—1.8.3] there are smooth positive convex 

functions K; such that Ki(r) ~ Fy(r®). If K;(0) > 0, it is easy to 

construct a convex function Kj : [0,00) > [0,0o) such that K; ~ Kj; on 

[1,00) and K;(0) = 0. Let us use & to denote the collection (K;j)7. 
Now, set 


m 
gl =llgla=min}r:g= |] si :6=41, Ai: =3 < Ki(r) 


Because of the convexity property of the Kj, || - || is a norm. Note also 
that it is well connected. The following theorem is proved in [18]. 


Theorem 4.3. Referring to the above notation and hypothesis, there 
exists a real D = Dg q = D(S,0) and a positive slowly varying function 
L (explicitly given in [18, Theorem 5.15] and which satisfies L(t®) ~ L(t) 
for all a > 0) such that: 


e For allr >1, V(r) = #19: Iigll <r} & rP Lr). 

e Foraeachl <i<k, | ee exists a regularly varying function F; 
such that ||s;*||%° < CF. lin) where F; > F; and with equality for 
somel<i<r. 

e For alln > 1, we (e) < Cn? L(n))7I. 

e The measure satisfies a pointwise (|| - ||,t te t!/"°) pseudo- 
Poincaré inequality. 


Here, we prove the following result. 


Theorem 4.4. Let G be a finitely generated nilpotent group equipped 
with a generating k-tuple (s1,..., 5%). Assume that ww is a probability 
measure on G of the form (15.14). Let ¢;, Fi, L, D = Dg.a, a0 € (0,2) 
and || - || be as described above. Then jw is strongly (|| - ||, t  t!/%) 
controlled. 


Proof. It suffices to estimate the quantities 6p and G(R) in the present 
context. For 6p, we have 


1 s 1 34 
1 4 1 


ss OL 
n>Fi( RYO) n. (n) 5; F(R) 
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A similar computation gives 
k R2 


ey ~ PR2-a0 


4.3. The Critical Case When aj;_2, 1<ix<k 


When a = 2 = (2,...,2), that is, a; = 2 for all 1 < 7 < k, we work with 
i usual word- length function |g| associated with the generating set 
= {5 Eu SR Ay In this case, V(r) = #{g: |g| <r} = rP(G) where 
Bie ) is the dlacacal degree of polynomial growth for the nilpotent group 
G. It is proved in [18] that uga(e) < < C(nlogn)~?/? and that HS,2 
satisfies a pointwise (| -|,t hb ine t)!/2) pseudo-Poincaré inequality. 
Further, |s”| ~ |n|//® with 6; => 1 and B; = 1 for some i. From this it 
easily follows that 


RX R-?, G(R) ~ log R. 


Applying Proposition 3.5 with r(t) = (tlogt)!/? yields the following 
theorem. 


Theorem 4.5. Let G be a finitely generated nilpotent group equipped 
with a generating k-tuple (s1,...,5,). Let D(G) be the volume growth 
degree of G. Then wg, is strongly (|-|,tt> (tlog t)'/?) controlled. 


5. Norm-Radial Measures 


In this section we assume that G is a finitely generated group with 
polynomial volume growth of degree D(G) and we consider norm-radial 
symmetric probability measures. 


5.1. Radial Measures with Stable-like Exponent «a é€ (0, 2) 


This subsection treats probability measures of the form 


1 
we) , I(t, y) X vex" y), 15.16 
ONE Tr TepeVGED? ee eon) 


where a € (0, 2], ||-|| is a norm on G and V(r) = #{g : |lg|| < r}. The 
case when a € (0,2) and V(r) ~ r@ for some d is treated in [1, 2, 14] 
where global matching upper and lower bounds are obtained. We note 
that [1, 2, 14] are set in more general contexts where the group structure 
plays no role. We start with the following easy observation. 


Lemma 5.1. Referring the situation described above, assume that V 
satisfies V(2r) < Cyp V(r) for allr > 0. Then dp =~ R™ and 


R2-" if € (0,2) 


oh) = logR ifa=2. 
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Proof. This follows by inspection. 


The next lemma follows by application of Proposition 2.3. However, 
in this lemma, we make a significant hypothesis on vf (e). 


Lemma 5.2. Set ry(t) = t!/* if a € (0,2), m(t) = (tlogt)!/. 
Referring the situation described above, assume that V is regularly 
varying of positive index and 


vi (e) < CV (ra(n))*. (15.17) 
Then vy ts (|| - ||, Te) controlled. 


The next theorem provides a basic class of examples when the 
hypothesis (15.17) regarding vg can indeed be verified. Note that the 
result is restricted to the case a € (0, 2). 


Theorem 5.3. Referring the situation described above, assume that V 
is regularly varying of positive index and a € (0,2). Then 


ve) ~ V(ni/*)-1 
and Vy is (|| - ||, tt t!/%) controlled. 


Proof. It suffices to prove the upper bound vf (e) = OV (nll), 
Start by checking that 


oe) 


af 1 1B(m)(2) 
Vo(x) & dX (ame m)lte V(m) 


where B(m) = {x € G: ||z|| < m}. Then apply the elementary technique 
of [3, Section 4.2] to derive the desired upper bound on vf (e). 


Remark 5.4. In the context of Theorem 5.3, we do not know if || - || 
is well connected and we also do not know if vg satisfies a pointwise 
({|- |], 7) pseudo-Poincaré inequality. Hence, the techniques used here do 
not suffice to obtain strong control. However, if || - || is well connected 
and vy satisfies a pointwise (|| - ||, 7) pseudo-Poincaré inequality then 
the strong (|| - ||, 72) control follows by Proposition 3.5. This proves the 
second statement in Theorem 1.9 because, referring to the notation of 
Theorem 1.9, the norm || - ||3,¢ is well connected and Theorem 2.10, 
Lemma 4.4, and Theorem 5.7 in [18] show that the measure vgq,g of 
Theorem 1.9 with B € (0,2) satisfies the pointwise (|| - l|g,a,t  t'/) 
pseudo-Poincaré inequality. 


5.2. Word-Length Radial Measures 


As noticed above, the study of vg in the case a = 2 is significantly more 
difficult than in the case aw € (0,2). In fact, we do not know how to 
treat this case in the generality described in the previous subsection. 
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The following theorem treats the case when ||- || is the usual word-length 
function || - || =|-| on G. 


Theorem 5.5. Assume that G is a group of polynomial volume growth 
equipped with generating k-tuple S = (s1,..., 8%) and the associated word 
length |-| and volume function V. Let D(G) be the degree of polynomial 
volume growth of G. Let vg be a symmetric probability measure such that 


vo(g) ~ (1+ Ig)? Vg): 
Then we have 
vs (e) & (nlog n)~PO/?, 


Further, v2 is strongly (|-|, t+ (tlog t)!/?) controlled. 


Proof. We apply Lemma 5.2 and Proposition 3.5. When G is nilpotent, 
the upper bound vs” (e) < (nlog n)~?(@/2 follows from Theorems 4.8 
and 5.7 of [18]. Namely, [18, Theorem 5.7] shows that 


K 
vy” (e) < Cugs (©) 


and [18, Theorem 4.8] gives wn(e) < C(nlog n)~ 2/2, Further, [18, 
Lemma 4.4 and Theorem 5.7] shows that v2 satisfies a pointwise (|-|, ft 
(t log t)!/2) pseudo-Poincaré inequality. 

Since any group of polynomial volume growth of degree D(G) contains 
a nilpotent subgroup of finite index (hence, with the same degree of poly- 
nomial volume growth) the upper bound vs (e) < C(nlog n)~P(G)/2 
follows from the comparison theorem [16, Theorem 2.3]. By direct 
inspection, the desired pseudo-Poincaré inequality also follows. 


Note that Theorem 1.7 includes the result stated in Theorem 5.5 as a 
special case and provides a very satisfactory result covering the behaviors 
of word-length radial measures across the second moment threshold. 


Proof of Theorem 1.7. The same technique of proof as for Theorem 
5.5 gives the much more complete and subtle result stated in the 
introduction as Theorem 1.7. Namely, let ¢@ : [0,co) > [1,00) be a 
continuous, regularly increasing function of index 2 and let vg be as 
in (15.3), that is, assume that vg is symmetric and satisfies v(g) ~ 
(gl) Vg)|~*. First, assume that G is nilpotent and let Hs,¢ be the 
measure given by 


k 
K 


1 
= 1,.7(q). 
H8,6(9) = 7 dX (+ Indo (9) 


Let r be the inverse function of t t> ig, AGE By [18, Lemma 4.4], the 
measure jg, satisfies the pointwise (| - |, 7) pseudo-Poincaré inequality. 
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By [18, Theorem 4.1], it follows that 1g, (€) < Vir(n))7!. By [18, 
Theorem 5.7], we have the Dirichlet form comparison Eugg < C&vg- 
Now, if G has polynomial volume growth then it contains a nilpotent 
subgroup with finite index, Go. By inspection, quasi-isometry, and 
comparison of Dirichlet forms (see [16]), it is easy to transfer both the 
pointwise (| - |, 7) pseudo-Poincaré inequality and the decay vp (e) < 
V(r(n))7! from Go to G. Further, one checks that the functions dp and 


G(R) satisfy 6p ~ 1/¢(R) and G(R) x a ir Proposition 2.3 with 


r(t) = R(t) equals to the inverse function of s te ce as ah shows 
that vg is (|-|,7) controlled. By Proposition 3.5, vg is strongly (| - |, 7) 
controlled. 


5.3. Assorted Further Applications 


The approach presented here is applicable even in cases where we are not 
able to obtain sharp results, and we illustrate this by an example. Let G 
be a nilpotent group equipped with a generating k-tuple S = (s1,..., Sx). 
Fix a e€ (0, g\* and set a, = max{a;,1 < i < k} < 2. Consider the norm 
Il - Ils,a defined at (15.5). Let vy» be any symmetric probability measure 
such that 


1 
(1+ Ilglls,a)* V(llgils,a)’ 


Theorems 3.2, 4.8, and 5.7 of [18] give the following information. There 
exists two reals D = Dg, and d = dg,q and a constant C1 € (0,00) 
such that 


Vs(g) ~ Vir) = #{9 = Ilglls,a <7}. 


Ww (e) < Cyn-*P/2 (log ny 4 (15.18) 
V(r) & r®?. (15.19) 


Theorem 5.6. For the probability measure v, on a finitely generated 
nilpotent group as described above, we have 


c(log log n) %*P (n log n) %P/2 < vi (e) < Cn-%?/? dog n)4, 


Proof. The volume estimate (15.19) and Lemma 5.1 gives 5g ~ R~? 
and G(R) ~ log R. In order to apply Proposition 2.3, we set R(t) ~ 
(tlog t)'/?. Further, we use (15.18)-(15.19) to verify that the choice 
r(t) = 6AR(t) loglogt with A large enough satisfies the condition of 
Proposition 2.3. Indeed, we have m(t) ~ t—-%D/2 (log t)-4, V(r) & r%P 
so that 


m(t) V(r(t)e 7 O/6RO < C (log t)—2+%P/? og log t)%P e~Aloglogt 


Clearly, for A large enough, the right-hand side is bounded above by 
a constant as required by Proposition 2.3, which now gives the stated 
lower bound on vf (e). 
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5.4. Complementary Off-Diagonal Upper Bounds 


In contrast with the case (15.4) of measures supported on powers of 
generators, for norm-radial kernels of type (15.16), we can use Meyer’s 
construction to derive good off-diagonal bounds for p(¢, e, x). 


Proposition 5.7. Let G be a finitely generated group equipped with a 
norm ||- ||. For a € (0,2), let vg be a symmetric probability measure on 
G satisfying (15.16). Assume that there exists a positive slowly varying 
function £ and a real D > 0 such that: 


LVr>1, Vinx rer); 
2.Vt>0,2€G, plt,z,z) < mt) ~ [t+ H?/ee'/2y)/-1. 


Then there exists C' such that, for allt > 1 and x € G, we have 


1+D/a €(t1/2) ‘ 
) (ie ll)’ 


p(t, e,z) < Cm(t) min | ( 


I|||% 


Remark 5.8. This proposition is stated here mostly for comparison with 
the next proposition. In fact, for the measure veg with a € (0,2), the 
hypothesis (1) implies automatically that (2) is satisfied as well. See 
[1, 2, 11, 14]. See [14] for a complete study of this case including two- 
sided discrete-time estimates. 


Proposition 5.9. Let G be a finitely generated group equipped with a 
norm ||- || with volume V. Let vo be a symmetric probability measure on 
G satisfying (15.16) with a = 2. Assume that: 


()Vr>1,V(r)xr?, 
(2) Vt>1, £€G, plt,2,2) < m(t) x (tlog t)-?/?. 


Then there exists C' such that, for allt > 1 and x € G, we have 


tl 1+D/2 
p(t, e,xz) < Cm(t) min | (Aa) sail 


lel? 


Further, for any y € (0,2), there exists Cy such that if 1 < t < ||a||” 


then 
Cy 4 \1+D/2 
ven 
mee 03 (za) 


Proof of Proposition 5.7. Under the stated hypothesis, we have 5p ~ 
R-® and G(R) ~ R?-“ and, for 1 < t < nR® (with n to be fixed later, 
small enough), Proposition 2.1 gives 


¢ \llell/3R 
PRC, é, L) = Cm(t) (=) 
4 6 
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By Meyer’s construction (see the notation introduced after (15.7)), we 
have 


p(t,2,y) < pr(t,z,y) +t | val|, 


1 t \l#ll/3R t 
a1 tD/oe, (¢1/2) (=) + Re+D/«)¢4(R)’ 


Choose R = R(a, t) such that the two terms of the sum on the left-hand 
side are essentially equal, namely, set 


je) el eon BO) (te = pais 
Pp ae = rod a Stay’ 


As long as 7 is small enough, this choice of R gives ||xz|| ~ R and 
Qt 1 t \HP/@ g, (4/2) 
p(t, zy) = a(1+D/a) = D/a 1/a a ; 
Il Ai(Itll) b(t) \ Mall (lel) 


For any ¢ (in particular, t > 1 R®) we can also use m(¢) for an easy upper 
bound. This gives 


t 14+D/a £,(t1/2) 
t Cm(t) min ) | —Z 
p(t, e,) < nomin| (ia) a (laid 


or, equivalently, 


p(t, e,t) < Cmin {tvg(|z]]), m(} . 


Proof of Proposition 5.9. In the context of Proposition 5.9, we have 
dp ~ R? and G(R) ~ logR. For 1 < t < nR?, n > 0 small enough, 
Proposition 2.1 and Meyer’s decomposition give 


p(t,t,y) < pr(t,z,y) +t l-pel, 


= tlog R\"#l/38 t 
< C(tlog t)-?/? (4S) + Foe" 


If R?/log R < t < R? then this bound is not better than the easy bound 
p(t, x,y) < m(t). By taking R such that ||z|| = 3RG + D/2), we obtain 


_ { (tlog |x| \t?/? 
p(t, z, y) < Cm(t) min aa) 7", mae 


llxll? 


However, if 1 < t < n|la||” with y € (0,2) and 7 small enough, then we 
can choose R ~ ||z|| so that 


_pa(tlogR\iW3® 4 
(t log t) —R = RUD? 


On SOME RANDOM WALKS DRIVEN BY SPREAD-OUT MEASURES 469 


equivalently, 
14+D/2 


R2 [z|/3R 7 R2 (log R)it+D/2 
tlog R tlog R (log t)P/2 © 


In the region t < ||z||”, this yields 


1+D/2 
2t hits 
deat eae 
|a\|2*? l|z|I? 


6. Random Walks on Wreath Products 


In this subsection, we illustrate how to use Proposition 1.4 to derive a 
lower bound for return probability of certain classes of random walks on 
wreath products. 

First, we briefly review the definition of wreath products and a special 
type of random walks on them. Our notation follows [15]. Let H, K be 
two finitely generated groups. Denote the identity element of K by ex 
and identity element of H by ey Let Ky denote the direct sum: 


Kya © Ki 
heH 


The elements of Ky are functions f: H > K, hw f(h) = ka, which 
have finite support in the sense that {h € H : f(h) = kp, # ex} is 
finite. Multiplication on Ky is simply coordinate-wise multiplication. 
The identity element of Ky is the constant function ex : hw eK 
which, abusing notation, we denote by ex. The group H acts on Ky by 
translation: 


thf(h') =f(h th), hy hl € H. 
The wreath product K 2 H is defined to be the semidirect product 
KiH=ky™x, dA, 


(fF A)(f'W) = (ftnf! bh’). 
In the lamplighter interpretation of wreath products, H corresponds to 
the base on which the lamplighter lives and K corresponds to the lamp. 
We embed K and # naturally in KH via the injective homomorphisms 
kt> k= (key, eH), key(ex) =k, key (h) = ex if h A ex 
htr> h=(ex,)h). 
Let uw and n be probability measures on H and K respectively. Through 


the embedding, and 7 can be viewed as probability measures on K2H. 
Consider the measure 


q=n* xn 
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on KH. This is called the switch-walk-switch measure on K 2 H with 
switch-measure 7 and walk-measure ju. 

Let (X;) be the random walk on H driven by wu, and let I(n, h) denote 
the number of visits to h in the first n steps: 


in, hy =#{i:0<i<n, Xj; =h}. 


Set also 
I(n, h) if h ¢ fez, g} 
iZ(n,h) =4 In,ey)—1/2 ifh=g 
I(n,ey)—1 if h = ey. 


From [15], probability that the random walk on K 2? H driven by g is 
at (h,g) € K24H at time n is given by 


gd (f,9) =E ( T] 2? G00) 


heH 
Note that E stands for expectation with respect to the random walk 
(Xi)g° on H started at ey. 
From now on we assume that 7 satisfies n(ex) = € > 0 so that 


en®Y(ex) < nex) < eg (ex). 


C : 
Write f x g if C~!f < g < Cf. Under these circumstances, we have 


1/e3 
a™(eK,9) & °(11 1H) 


heH 


so that we can essentially ignore the difference between / and |,. 
Set 


F(n) := —logn®” (ex) 
so that, for any g € H, 


g™((ex,9)) = E (ee Lu PEO a) (15.20) 


Proposition 6.1. Let H be a finitely generated group equipped with 
a symmetric measure w with u(ey) > 0. Let K be a finitely generated 
group equipped with a symmetric measure n with n(ex) > 0. Let ||-|| be a 
norm with volume function V. Let r be a positive continuous increasing 
function. Assume that: 


(1) The measure c is strongly (|| - \l,7) controlled and V_ satisfies 
VQae. 

(2) The function r satisfies r(t) = t'/Fe,(t) where 1 is a positive 
continuous slowly varying function. 

(3) The function Fx (n) = —logn?@™ (ex) ~ n” €2(n) where y € [0, 1) 
and €2 is a positive, continuous, slowly varying function. 
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Assume also that the slowly varying functions €;, i = 1,2, are such 
that €;(t%) ~ €;(t) for alla > 0. Then the switch-walk-switch measure q 
on K2.4i associated with the pair n, wu satisfies 


( D(i~y)+yB Da- B 
g (©) = exp (—Cn DEVE ¢, (n) BOVE b(n) PTF 
Proof. Let m be the spectral measure of n) in the sense that 
/ t™dm(t) = v2” (0). 
[0,1] 
For x € [0, 00), set 


F(a) := tog [ t” dm(t). 
[0,1] 
Observe that Fx(n) = F(n) and that F is a concave function. For 
t > 0, let B(t) ={h eH: ||h|| < Tt}. Since q2@™(e) > gq?” (x) for any 
xz € KH, (15.20) yields 


q'™(e) = Zi : 


E (e- Dn Fr(ln,h))4 
B(t) 


t1Xal=n) 


= y Fe (Un,h)) 
= THETA Bee Te Umax cent Xel St) 


Using the concavity of F and the confinement of the walk (X,) on A in 
the ball B(t) in the last expression, this yields 


1 
g™(e) => —~e VOF@/VO)P, ( max {|| Xgll} <7). 
V(t) l<k<n 


Let tr be such that 
V (tn) F(n/ V (tn) = n/p (tr) 
where p is the inverse of r. By our various assumption, this means 
(tp /n)'Yla(n/ty) = ty Pl (tn)?. 


Hence n —> Tr, is a regularly varying function of order (1 — y)/(6 + D 
(1-—y)) < 1/8. This shows that r(n) > Ty and, since wu is strongly 
(| - |], 7) controlled, Proposition 1.4 yields 


g™ (e) > te Veen) Fln/ Ven) g—Gn/plEn)  g-Cin/plEn) 
V(tn) 


and 


n DQ~y)+yB B 
= n DU-y)+B £4 (n) bas ee £9(n) PG-v)8 , 
P(Tn) 


This gives the stated lower bound on q‘(e). 
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Remark 6.2. The case y = 1 is excluded from these computations. It 
can be treated by the same method, but t, becomes a slowly varying 
function of n. 


Remark 6.3. In the setting of Proposition 6.1, suppose in addition we 
have H = Z% and w is in the domain of attraction of an operator- 
stable law v on R¢, that is there exists a normalizing sequence By, € 
GLq(R) such that By Tu*” = y, Then the lower bound in Proposition 
6.1 is sharp and agrees with [17, Theorem 4.2]. Note that in this case, 
det Bn ~ V(r(n)), the scaling relation in [17, Theorem 4.2] reads 


det B 
an det Ba,, ie n S4) 
n det Ba, 


and it agrees with 
V(t) F(n/ Vn) = 2/P (Tn), 
with an XY p(T). 


Example 6.4. Consider the symmetric probability measure yz on Z of 
the form 


L 
u(n) = s Kei ([n)) 


l+a 
2 TF in) 


where a € (0,2) and £1 is a positive, continuous, slowly varying function 
satisfying €1(t°) ~ €1(t) for all b > 0. We have 

; Kl\(R) 2 K 2-0 
a= Deu Beg 2s ODE cg "SIN 


Therefore R?5p/G(R) > (2 —«)/a. By a classical result (see [9]), ju is 
in the domain of attraction of an a-stable law on R. The normalizing 
sequence by, can be chosen as the solution to the equation nb; 2G(bn) = 1, 


that is by, ~ (s&neim) Let K be a finitely generated group 
equipped with a symmetric measure n with n(ex) > 0 . Suppose that 
the function Fx(n) = —logn?™ (ex) ~ n”€2(n) where y € [0,1) and 
£9 is a positive continuous slowly varying function. Assume also that 
lo(t%) ~ 9(t) for all a > 0. Then [17, Theorem 4.2] (and the remark 
following that statement in [17]) implies that the switch-walk-switch 
measure g on K 2 A associated with the pair 7, ju satisfies 


(n) (=y)+yp (l-y) B 
—log q™ (€) = n/ dn = 2 CF £3 (n) OV b(n) TIF , 


where dy is computed from the scaling relation 


b 
Gn ban By (+) aan 
n ban 
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This agrees with the lower bound in Proposition 6.1. Note that by 
Proposition 3.5, the measure p is strongly (|-|,7) controlled where 
r(n) = (néy(n)) 
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Abstract 

In July 2012 the General Assembly of the United Nations resolved 
that 2014 should be the International Year of Crystallography, 100 
years since the award of the Nobel Prize for the discovery of X-ray 
diffraction by crystals. On this special occasion, we address several topics 
in mathematical crystallography. Especially motivated by the recent 
development in systematic design of crystal structures by both math- 
ematicians and crystallographers, we discuss interesting relationships 
among seemingly irrelevant subjects; say, standard crystal models, tight 
frames in the Euclidean space, rational points on Grassmannian, and 
quadratic Diophantine equations. Thus, our view is quite a bit different 
from the traditional one in mathematical crystallography. 

The central object in this article is what we call crystallographic 
tight frames, which are, in a loose sense, considered a generalization 
of root systems. We shall also pass a remark on the connections with 
tropical geometry, a relatively new area in mathematics, specifically with 
combinatorial analogues of the Abel-Jacobi map and Abel’s theorem. 
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1. Introduction 


It is my pastime to make various models of crystals by juggling a kit 
that I bought at a downtown stationer’s shop. Though it is not always 
possible to make what I want because of the limited usage of the kit, I can 
still enjoy playing with it. For instance, my kit allows me to produce the 
model of the diamond crystal whose beauty, caused by its big symmetry, 
has intrigued me for some time and motivated me to look for other crystal 
structures, if any, with the similar symmetric property as the diamond. 
Actually as shown in [35] there exists the only structure that deserves 
to be called the diamond twin! (Fig. 16.1). 

In the eyes of mathematicians, a crystal model as a network in space is 
simply a (piecewise linear) realization of an infinite-fold abelian covering 
graph over a finite graph. The key in this observation is that the 
translational action of a lattice group leaving the crystal model invariant 
yields a finite quotient graph,” and that the canonical map onto the 
quotient graph is a covering map whose covering transformation group 
is just the lattice group. This simple fact leads to the definition of 
topological crystals of arbitrary dimension, and can be effectively used to 
enumerate all topological types of crystal structures because an abelian 
covering graph over a finite graph Xo corresponds to a subgroup of the 
first homology group H1(Xo, Z); thereby the enumeration being reduced 
to counting finite graphs and subgroups of their first homology groups 
[37], [38]. Needless to say, however, there are infinitely many ways to 


Figure 16.1 Diamond and its twin. From WebElements 
http://www.webelements.com. 


HB 


This is what I call the Ky, crystal [35], [38] because of the fact that it is, as a 

graph, the maximal abelian covering graph over the complete graph Ky4 consisting 
of four vertices. The structure was for the first time described by Fritz Laves in 
1933. Diamond and its twin are characterized by the “strong isotropic property,” 
the strongest one among all possible meanings of isotropy. 
Refer to [9] for this observation made in the community of crystallographers. The 
vector method mentioned in this reference can be interpreted in terms of cohomology 
of the quotient graph (Sect. 7). Historically A. F. Wells is the crystallographer who 
initiated a systematic study of crystal structures as 3D networks [42], [43]. See [32] 
for some recent views on mathematical crystallography. 


iw) 
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realize the covering graph. Thus, it is natural to seek a “standard model” 
having as many symmetries as possible just like diamond and its twin. 

Standard realizations introduced in 2000 by M. Kotani and myself [25], 
[27] in connection with asymptotic behaviors of random walks may be 
called standard models. Indeed the standard realization of a topological 
crystal X has maximal symmetry in the sense that every automorphism 
of X extends to a congruent transformation leaving the realization 
invariant. Moreover, crystal models with “big” symmetry turn out to be 
the ones obtained by standard realizations (see Theorem 7.7 in Sect. 7 for 
the precise formulation). Figure 16.2 illustrates several three-dimensional 
examples.® Classical two-dimensional lattices such as the square lattice, 
the regular triangular lattice, the honeycomb, and the kagome lattice are 
also standard realizations. An interesting feature following the tradition 
of geometry is that standard realizations are characterized by a certain 
minimal principle, just like the characterization of the round circle by 
means of the isoperimetric inequality. Furthermore, this notion combined 
with the enumeration of topological crystals provides a useful method 
for a systematic design of crystal structures. Actually, there is a simple 
algorithm for the design with which one can create a computer program 
to produce the CG images of two- or three-dimensional crystals.4 

Among all standard models, the simplest one is the cubic lattice (the 
jungle gym-like figure in plain language). As a matter of fact, the cubic 
lattice is not very interesting as a crystal model,° but from some “view,” 
this lends itself to another recreation and gives rise to an interesting 
mathematical issue that is linked to the standard models of general 
crystal structures mentioned above. 

Let us look at the cubic lattice from a sufficiently remote distance. 
What we find when we turn it around is that there are some specific 
directions toward which we may see two-dimensional crystalline patterns 
(ignoring the effect of perspective). For instance, one can see the square 
lattice and the regular triangular lattice as such crystalline patterns. 

Mathematically, we are looking at the image in the cartesian plane 
R? of the cubic lattice placed in R® by the orthogonal projection P : 
(x,y,z) > (x,y) (see Figure 16.3). Here the cubic lattice is supposed 
to be generated by an orthonormal basis f;, fo, £3 of R?. Thus, the set of 
vertices in it is 


3 Lonsdaleite (named in honor of Kathleen Lonsdale) in this figure is thought of as 
a relative of diamond, but is not isotropic. 

4 Due to Hisashi Naito. Crystallographers also sought standard models; see [14], 
[16], [20] for instance. Some of their models are the same as ours. The algorithm 
SYSTRE created by Delgado—Friedrich in 2004 produces the barycentric drawing, 
which seems to coincide with standard realizations as far as several examples are 
examined. See also [29]. 

5 Sodium chloride (NaCl) crystallizes in a cubic lattice. 


TOSHIKAZU SUNADA 478 


Figure 16.2 (a) Lonsdaleite (from WebElements http://www.webelements.com), 
(b) ThSig structure, (c) three-dimensional kagome lattice, (d) net associated 
with the face-centered cubic lattice, (e) net associated with the body-centered 
cubic lattice 


Figure 16.3 Projected images of the cubic lattice 


{ky fy + kofe + k3f3| ky, ko, kg © Z}. 
We now put v; = (a;, 6;) = P(f;). Because 
(x, fi) fi + (x, fo) f + (x, f3)e3 =x (x € R®), (16.1) 


and (x, f;) = (x,v;), Px =x for x € R?, we have, by projecting down the 
equality (16.1) to the z-y plane, 


(x, vi)v1 + (x, v2) vo + (x, v3)v3 =x (xe R?). (16.2) 
The projected image of vertices in the cubic lattice is given by 


{kivy + kava + kava k1, ko, k3 € Z}. 
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Figure 16.4 


What we need to notice here is that the projected image does not always 
give a crystalline pattern. For instance, Figure 16.4 depicts evenly spaced 
parallel lines expressing the closure of the projected image of vertices in 
the case v1 = (1,0), v2 = #(0,1), v3 = t(1, /2), where we should note 
that {m+nV/2| m,n € Z} is dense in R (more generally, given a positive 
irrational number a, one can find infinitely many positive integers p, q 
such that |a — q/p| < 1/p? (Dirichlet’s theorem), from which it follows 
that {m+ na| m,n € Z} is dense). 

Actually the image of vertices in question is dense in almost all cases. 
To have a crystalline pattern, it is necessary (and sufficient) that three 
vectors v1, V2, V3 generate a lattice in R?, or equivalently there exists a 
triple of integers (n1, 2,3) 4 (0,0,0) such that 


nv + neve + n3gv3 = 0, (16.3) 


where one may assume without loss of generality that the greatest 
common divisor of n1, 2,3 is 1. Then the kernel of the homomorphism 
p : Z° —> R? defined by p(k1, ko, k3) = kivitkeve+k3v3 coincides with 
A := Z(n1, n2, n3). Going back to Figure 16.3, we observe that the square 
lattice (a), regular triangular lattice (b) and the lattice (c) correspond 
to (no, m1, 2) = (1,0,0), (mo, m, m2) = (11,1), (no, m1, n2) = 1,1, 2), 
respectively. 

This expository article, including a few new results, is thought of 
as a continuation of my book [38] published in 2012. The purpose is, 
starting from the above elementary observations, to share a link to a few 
mathematical subjects, say tight frames in the Euclidean spaces, rational 
points on Grassmannians, and quadratic Diophantine equations. Those 
subjects are not things of novelty (for instance, tight frames appear in 
various guises in practical sciences), but turn out to be closely connected 
with each other in an interesting way. 

The protagonist is crystallographic tight frames introduced in Sect. 3. 
This notion generalizes the above-mentioned situation, and is closely 
related to a systematic design of crystal structures through the notion 
of standard realizations (Sect. 7). Further, this in a special case is 
regarded as a generalization of root systems whose origin is in the work of 
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Figure 16.5 Examples of two-dimensional Coxeter complexes 


W. Killings, E. Cartan, and H. Weyl on Lie groups. Actually, irreducible 
root systems yield highly symmetrical crystallographic tight frames. It 
should be pointed out that root systems pertain to Euclidean Coxeter 
complexes (cf.[7]), very remarkable triangulations of the Euclidean space, 
named after H. S. M. Coxeter (see Figure 16.5 for two-dimensional 
examples that correspond to the root systems Bz and G2). A remarkable 
fact is that the one-skeleton of a Coxeter complex is the standard 
realization of a crystal structure (Sect. 7). 

As is well known, root systems are completely classified by means of 
Dynkin diagrams. On the other hand, as will be described in Sect. 4, 
similarity classes of crystallographic tight frames are parameterized 
by rational points on Grassmannians (this is by no means surprising 
if we rephrase the above observation as “the projection of the cubic 
lattice towards a rational direction gives rise to a crystal pattern”). 
Certain quadratic Diophantine equations show up when we explicitly 
associate crystallographic tight frames with rational points. A note- 
worthy situation occurs in the two-dimensional case especially; we may 
parameterize the (oriented) congruence classes by “rational points” on 
a certain complex projective quadric. A rational point we mean here 
is a point in a complex projective space each of whose homogeneous 
coordinate is represented by a number in an imaginary quadratic field. 
In Sect.6, we explain a relationship with tropical geometry, skeletonized 
version of algebraic geometry, especially with discrete Abel-Jacobi maps. 
This unexpected link is brought about via crystallographic tight frames 
associated with finite graphs. The final section presents a link between 
discrete Abel-Jacobi maps and standard realizations. 

Not surprisingly, the materials here have something to do with, 
not only the legacy of Eukleides-Archimedes—Kepler’s achievements in 
polyhedral geometry,® but also with the geometry of numbers and the 
theory of quadratic forms because of the crucial role played by lattice 
groups in crystallography. 


6 Legend has it that Pythagoras derived the notion of regular polyhedra from the 
shape of a crystal. 
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Before leaving the introduction, let us fix a few notations used 
throughout. We express a matrix A by (aj) for simplicity when the 
(i, jt entry of A is ajj. The transpose of A = (aj;;) expressed by tA is 
the matrix whose (1, jth entry is a;;. The trace of a square matrix A, the 
sum of the diagonal entries of A, is denoted by tr A. The determinant of 
A is denoted by det A. 

Given a field K, we think of K@ as a vector space over K consisting 
of column vectors x = '(21,...,2q) with «; € K. The subspace spanned 
by vectors X1,...,X,% € K4 is expressed by (x1,...,X%)K- We denote by 
Mm,n(K) the set of all m x n matrices whose entries belong to K. We 
also use the notation M,(K) for Mnjn(K). The identity matrix (6;;) in 
Mn(K) is denoted by In, or simply I. 

A matrix A € Mm,(K) is identified with the linear operator of K” 
into K™ given by x t+ Ax. For a linear operator T : K" —> Kk™, 
the kernel of T is written as ker T. The image of T is denoted by 
Image T. When the matrix A € Mmjn(K) consists of column vectors a; 
(¢=1,...,n), we write A= (aj,...,an). 

The (standard) inner product (x,y) of two vectors x = '(a1,...,2p), 
y = “(y,---. Yn) in R” is xy = yy, +--+» + 2nYn- The norm |x|] is 
(x,x)!/?, For a subspace W C R”, the orthogonal complement {x € 
R"| (x,y) =0 (y € W)} is denoted by W+. 
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2. Tight Frames 


Property (16.2) tells that vj,v2,v3 form a tight frame of R?, a ter- 
minology originally used in wavelet analysis. The basic philosophy of 
tight frames is that representations such as (16.2) are similar to an 
orthogonal expansion when considering an infinite dimensional Hilbert 
space such as L? (R4), and that one may have more freedom in choosing 
the v; to have desirable properties such as certain smoothness and small 
support properties that may be impossible were they to be orthogonal 
(see [15] for a pioneer work). Tight frames are intimately related to 
rank-one quantum measurements [19]. In the finite dimensional case 
they are seen in the study of packet-based communication systems 
(refer to [22] for instance), and also show up as spherical two-designs 
in combinatorics [41]. In this chapter, we will give a completely different 
view to tight frames. Our development is guided by the idea indicated 
in the Introduction. 
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Let us start with some fundamental facts on tight frames which are 
more or less known (cf. [40]). Only some rudiments of linear algebra are 
required to read this section. 

In general, a sequence of N vectors S = iW in R@ is said to be 
a d-dimensional frame of size N or simply frame if it generates R@. In 
this definition, some of v; allow to be zero or parallel. Given a frame, 
we may associate a linear operator (called the frame operator) S = Ss : 
R? —> R?¢ by setting 

N 
S(x) = Do (x, vi)vs, 
i=1 


which is symmetric and positive. The matrix for S is given by 


S=(v,...,vy) '(v1,...,¥w) = vilvi +++>+vy'vy, 
and hence 
N 
trS= IIvill?. (16.4) 
i=l 


A frame S is said to be a-tight (or simply tight) if S = alg with a 
positive q, 1.e., 
N 
Yo (x, viva =ax (xe R2). 
i=1 
In view of (16.4), if S is 1-tight, then 


N 
2 

yo Iivall? = 4. 

i=1 


Tightness (resp. 1-tightness) is obviously preserved by similar transfor- 
mations and permutations of subscripts 7 in v; (resp. by orthogonal 
transformations). Here two frames Sj = {uj} , and Sg = {vi} 1 are 
said to be similar if there exists an orthogonal transformation U of R@ 
and a positive number 4 such that uy =AU (vj) (@ = 1,...,N). If A=1 
in this relation, S; and S9 are said to be congruent. 

A tight frame appears in the following situation. 


Proposition 2.1. Suppose that a finite group G acts on R@ as 
orthogonal transformations, and let S = els be a frame in R* which 
is invariant under the G-action (precisely speaking, for any g € G, there 
exists a permutation o of {1,...,N} such that gvi = Voi). If the G-action 
on R®@ is irreducible, then S is a tight frame satisfying ee vi =0. 


To check this, we note that the G-action commutes with the frame 
operator S. Looking at eigenspaces of S, we conclude that S = a/q for 
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P; 


Figure 16.6 A regular polygon 


some positive scalar a, as claimed. For the claim ee v; = 0, we only 
have to notice that the left-hand side is a G-invariant vector. 

Using Proposition 2.1, one can prove that for points P,,...,Py 
(N > 3) in the plane R? forming a N-regular polygon with the barycenter 
O (Fig. 16.6), the vectors vy = OP,, .5VN = OPw yield a tight frame 
(of course, one can prove this by a direct computation). By the same 
reasoning, five Platonic solids (regular convex polyhedra) and thirteen 
Archimedean solids (semi-regular polyhedra) yield tight frames of R® 
(cf. [13]). 

Tight frames have a similar nature to the orthonormal basis seen in 
the following proposition. 


Proposition 2.2. The following three conditions are equivalent to the 
1-tightness of a frame S = {Vel as respectively. 
N 
(1) (vi, x)? = |x? «eR, 


N 
(2) Sowa. x)viy) = y) Gy eR, 


N 
(3) Y(Tvj,¥5) = tr T for any linear transformation T of R¢. 


Proof. We only show that (3) implies the 1-tightness because the other 
claims are easy to check. Consider the operator T defined by T(x) = 
(x, y)y. Evidently tr T = |ly||? and 


Applying the equality (3), we have Day, v;)? = |ly||?. Thus by (2), S 
is 1-tight. 
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Recall that the 1-tight frame {v1,v2,v3} mentioned in Introduction 
was obtained as the projected image of an orthonormal basis of R°. This 
is true for general 1-tight frames; that is, any 1-tight frame S = fea 1 
in R¢ is a projected image of an orthonormal basis of R‘. To see this, 
we shall introduce an auxiliary operator (matrix). Given a frame S = 
Cane define the frame projection P = Ps : RN —> R¢4 by 


P('(a,...,0n)) = avi +++: + nv, 


i.e., P is the linear operator characterized by P(f;) =v; (¢ =1,...,N) 
where {fi} , is the fundamental basis of RY. The matrix for P is nothing 
but (v1,...,vy). Therefore, we have S = P *P. On the other hand, the 
matrix for ‘PP is the Gramm matrit Gs = (vi, vj)). 

The next proposition is considered a special case of Naimark’s dilation 
theorem in the theory of quantum measurements. 


Proposition 2.3. The following four conditions are equivalent: 
(1) S is 1-tight. 
(2) P= Ps satisfies P'P = I. 
(3) P is orthogonal in the sense that the restriction P| (ker P)~ : 


(ker P)* —> R®@ is an isometry (i.e., it preserves the inner 
products). 

(4) "PP RN —+ R% is a orthogonal projection, or equivalently the 
Gramm matrix satisfies G2 = Gs. 


Proof. (1) = (2) is obvious. To show (1) = (3), suppose that S is 
1-tight. Since P ‘P = I, we find 


(P(x), "P(y)) = (P P(x), y) = (xy) 

and 
(P'P(x), P'P(y)) = (x,y). 

Hence 

(P'P(x), P'P(y)) = (Pw), ‘P(y)). 
Since (ker Pp) = Image ‘P, we conclude that P| (ker Py ; (ker Pp) — 
R? is an isometry. 

Next suppose that P| (ker P)* ‘ (ker P)~ —> R? is an isometry. Again 

using (ker P)* = Image’ P, we have 

(P'P(x), P'P(y)) = (P(x), Py), 
or equivalently 

(57(x),y) = (S(),y). 

This implies S? = S, and hence S$ = J. 
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(1) = (4): If S is 1-tight, then 
((PP)(PP) ='"P(P'P)P ='PP. 


Conversely if (‘PP)? = *PP, then P'PP = P since * P is injective. But 
P is surjective, so P'P = I. 
For a frame S of R?, the (N — d)-dimensional subspace W(S) = ker Ps 


of R% is called the vanishing subspace for S, which obviously depends 
only on the similarity class of S. 


Proposition 2.4. (1) For any subspace W Cc RN of dimension N — d, 
there exists a d-dimensional tight frame S of size N such that W = 
W(S). 

(2) Two tight frames S1,S2 are congruent if and only if W(S1) = 
W (So). 


Proof. (1) Let p: RN —+ RN be the orthogonal projection onto Wt. 
Choosing an isometry i: W+ —> R¢, we put P = ip, which is obviously 
a frame projection satisfying (3) in Proposition 2.3 such that W = ker P. 

(2) If ker Ps, = ker Ps,(= W), then since Pj = Ps,|Wt: Wt —> 
R?@ is isometry, U = P, P27! is an orthogonal transformation such that 
Ps, = UPsy- 


The symmetric group Gy of {1,2,...,N} acts on RY as axis 
permutations‘, i.e., 


o(f;) =f (6 € Gy). 


The automorphism group Aut(S) of a 1-tight frame S = (eae is 
defined to be the subgroup of Gy consisting of 0 € Gy satisfying 
o(W(S)) = W(S). By virtue of Proposition 2.4 (2), there is an injective 
homomorphism U : Aut(S) —> O(d) such that U(o)vj = Voviy. 

If Aut(S) acts transitively on {1,2,...,N}, then S is said to be 
isotropic’. 

Tight frames associated with regular polygons, Platonic solids, and 
Archimedean solids are isotropic. Among them, the equilateral triangle 
and the regular tetrahedron are very special in the sense that Aut(S) 
agrees with Gj. We shall say that S with the property Aut(S) = Gy 
is strongly isotropic. 

The higher dimensional analogue of the equilateral triangle and the 
regular tetrahedron is the equilateral simplex, a simplex A in R¢@ whose 


7 This is called the standard representation of Gy. 

8 An isotropic tight frame Wes is uniform in the sense that ||vz|| = --- = ||vy||. The 
notion of uniform tight frame appears in various applications. The classification of 
isotropic frames is obviously related to that of subgroups of Gy acting transitively 
on {1,2,...,.N} which has been pursued for over a century since the 1860 Grand 
Prix of the Académie des Sciences. 
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edges have equal length. Suppose that its barycenter is the origin O, and 
let P1,..., Pa41 be its vertices. The phates group Gq41 acts on A 
as seGeorek transformations. Thus, {viet | defined by v; = OP; isa 
strongly isotropic tight frame. 

Conversely we have the following. 


Proposition 2.5. Let S be a strongly isotropic 1-tight frame in R4 
(d > 2). Then S is the frame associated with the equilateral simplex. 


Proof. Since Aut(S) = Gy, invariant subspaces for the Aut(S)-action 
on RN are either W = {*(2,2,...,2)|x € R} or W+ (indeed, RY = W@ 
W- gives the irreducible decomposition for the Gy-action). Because 
d > 2, the vanishing subspace for S must be W, and N = d+ 1. Then 
the frame projection P is identified with the orthogonal projection of 
R4+! onto W+. Hence 


1 d+1 
vi = Pi) =f - el Dak 

i=1 
Since (vi, vj) = dy — gz) if vi = OP;, then (P)P;)? = IIlvi — vy ll? = 
2 (« #3). We thus conclude that Pi,...,Pqa41 is the vertices of the 
equilateral simplex. 


The proof of the following proposition is left as an exercise for the 
reader. 


Proposition 2.6. If {OP; OP;}4t) is a tight frame of R®, and Dea 
OP; = 0, then Pj, Po,..., Paz. be the vertices of an equilateral simplex. 


We go back to the general case. In terms of matrices, what we have 
said in Proposition 2.3 is rephrased as 


Proposition 2.7. The row vectors in a matrix 


a1 a9 ad 
az1 422, ++ Aad 

A= et € Mn,a(R) 
Q@N1 @N2 -*: Nd 


give rise to a 1-tight frame if and only if the column vectors of A form 
an orthonormal system (i.e. ‘AA = Iq). 


For if we define S = {vi}, by writing ‘A = (v1,..., vy), then A is 
the matrix for Ps. 
Writing A = (aj,...,aq), we find that ker P = (aj,...,aq)p+ because 


(aj,...,aq)R+ = (Image A)t = ker ‘A. 


Thus we have the following proposition which rephrases Proposition 2.4. 
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Proposition 2.8. Given an (N — d)-dimensional subspace W of RN, 
there exists a solution A € My a(R) of the equations 


‘AA= Iq, (16.5) 
‘Ax =0 (xe W). (16.6) 


If Ay, Ag are solutions, then there exists U € O(d) such that Ay = AU. 


We now give an explicit parameterization of congruence classes of 
1-tight frames. We denote by Ty(R®%) the set of congruence classes 
of 1-tight frames {vi}, in R®. Proposition 2.7 tells us that the set of 
1-tight frames is identified with the Stiefel manifold V a(R) (={Ae 
My ,a(R)| ‘4A = Iq}). The action of the orthogonal group O(d) on 
Va(R%) by Ae Vg(RX) BAUM € Va(R) (U € O(d)) is compatible 
with the action of O(d) on the set of 1-tight frames because 


*(AU"!) = U‘A= U(™M,..., vw) = (U(V1),..., Uy). 


Therefore, Ty(R%) is identified with the quotient space Vq(R )/O(d), 
where the canonical projection @ : Vq(RY) —_— Vq(R)/O(d) = 
Ty(R2) coincides with the map which brings 1-tight frames to their 
congruence classes. 

The quotient space Vq(RN )/O(d) is nothing but the the Grassman- 
nian Gr v—d( Rs i.e., the set of (N — d)-dimensional subspaces of RY. 
Therefore, T n(R®) is identified with Gry_q(RX ), which is also iden- 
tified with Grg(R%) via the correspondence W +> W+. Under these 
identifications, the canonical projection g turns out to be nothing but 
the map Vq(RN ) — Gra (RY) giving the well-known structure of an 
O(d)-principal bundle. 

If we ignore the order of vectors in tight frames, it is natural to take 
up the quotient space Gy\Ty(R?) ( = 6y\Va (RY )/O(d)) where the 
symmetric group Gy acts on Va(R¥) by (aij) (4,-1(4)3) (o € Gy). 
For a 1-tight frame S = {vies the isotropy group of the point 
*(v1,...,vy) O(d) € Vg(R%)/O(d) coincides with Aut(S). 

In view of Proposition 2.8, the Stiefel manifold Vq(RY) is regarded 
as a “quadric” in My q(R), and a 1-tight frame with the vanishing 
group W is obtained as a point in the intersection of the quadric 
Vq(R) and the subspace {A € Mn,a(R)| ‘Ax = 0 (x € W)}. 
Such locution turns out to become more natural when we consider the 
set of “oriented” congruence classes of two-dimensional tight frames 
(see Sect. 4). Here the set of orientated congruence classes of 1-tight 
frames is Ty(R¢) = V4(R%)/SO(d), which is identified with the 
oriented Grassmannian Grg(RX ), the manifold consisting of all oriented 
d-dimensional subspaces of RN. It is a double cover over Grq(RY). 

We close this section by giving a simple remark on Gramm matrices 
associated with frames. 
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Proposition 2.9. Two frames S1 and S2 are congruent if and only if 
Gs, = Gs,- 


Proof. It suffices to show that if b1,...,by span R4, and 


(bj,b;) = (c;,c¢;) (@,j =1,...,N) 


for vectors c1,...,¢y in R®, then there exists an orthogonal matrix U € 
O(d) such that ¢; = Ub;. Certainly this is true when b1,...,by (N = d) 
is a basis. In the general case, we take a basis b;,,...,b;,, and U € O(d) 


with Ci, = Ubi, . Then 
(cj, ,€x) = (by,, bg) = (Ub; , Ubg) = (cy, Ubx), 


from which it follows that c, = Ub, for every k, as required. 


Proposition 2.3 (4) combined with the above proposition tells us that 
Ty (R%) is identified with 


{G€ My(R)| 'G=G, G2 =G, mak G = d}. 


3. Crystallographic Tight Frames 


We come now to the principal subject of this chapter. Recall that the 
tight frame {v,,v2,v3} mentioned in Introduction forms a crystalline 
pattern if and only if it generates a lattice in R?. Having this fact in 
mind, we shall introduce the notion of crystallographic tight frame. 

Before going into the subject, we review some items in the theory 
of lattice groups, which are often used in the rest of this article. In 
general, for a lattice £ in an n-dimensional vector space M with an inner 
product (-,-), we denote by vol(//L) the volume of the flat torus M/ZL. 
If {atorus;,...,an} is a Z-basis of £L, then vol(M/L)? = det ({a;, a;)). 
Further we have 


vol(M/L1) = |£2/Li|vol(M/L2) 


for two lattices £1, £2 with Ly C Lo. 
We denote by £L* the dual lattice of L; namely L* = {x € M| (x,y) € 
ZA(ye Ls We observe 


vol(M/L*) = vol(M/L)7!. 
Therefore, if £ is integral, i.e., LC Li, then 
IL# /L| = vol(M/L)?, (16.7) 


in particular, vol(M/L)? is an integer. 
A frame S = Wiles of R@ is said to be crystallographic if it generates 
a lattice in R¢, or what is the same is that Ls = Image pg is a lattice, 
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where ps = Ps|ZN : ZN —+ R4. We shall designate Lg as the periodic 
lattice for S. 

The vanishing subgroup H(S) associated with a frame S is defined to 
be the kernel of the homomorphism pg, i.e., '(ki,..., ky) € H(S) if and 
only if kivy +---+kyvy =0. 

We have a sufficient condition for a frame being crystallographic 
(this turns out to be a necessary condition if the frame is 1-tight; see 
Proposition 4.2). 


Lemma 3.1. A frame S = {¥iey is crystallographic provided that 
the Gramm matrix Gg is essentially rational, i.e., (vi,vj) € AQ (i,j = 
1,...,N) witha > 0. 

Proof. Put vj = (vi,vj). Suppose that fyile does not generate a 
lattice. Then 0 is an accumulation point in R® of the subgroup 


N 
Yo kivil kee ZG=1,...,N)}, 
i=1 


so that one can find a sequence {%pj}°_, € Z such that 

N N 

S Inivi ~ 0 and im. Ss Inivi = 90. 
iad 


i=1 


On the other hand, 


N 2 N 
ps TniVil) = oa Vig Tni ny - 
i=l ij=l 


Take a positive integer N such that A Noy € Z. Since {A7'N are 
uj tj 2;| tj € Z} is discrete, we have a contradiction. 


The period lattice £¢g is just the set of vertices of the network obtained 
as the projected image by the frame projection of the hypercubic lattice. 
Here hypercubic lattice means the net associated with the N-dimensional 
standard lattice Z", a generalization of the square and cubic lattices. 

The hypercubic lattice as a graph is the Cayley graph X associated 
with the free abelian group ZN with the set of generators {f1,...,fn} 
(remember that {f;} is the fundamental basis of RY ). Thus the quotient 
graph X/H(S) by the natural H(S)-action on X is the Cayley graph 
associated with the factor group Z /H(S) with the set of generators 
{f +H (S)}®,. Furthermore, X is the maximal abelian covering graph 
over the N-bouquet graph, the graph with a single vertex and N loop 
edges. The projected image of the hypercubic lattice can be thought 
of as a (periodic) realization of the abstract graph X/H(S) (possibly 
having degenerate edges, multiple edges, and/or colliding vertices when 
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realized in R4%, like the square lattice in Figure 16.3 (a)). The map 
(graph morphism) w of X onto X/H(S) associated with the canonical 
homomorphism — ZN /H(8) turns out to be a covering map, and 
is compatible with the frame projection P of the hypercubic lattice onto 
the projected image. 


X —— RN 


o| |p 
X/H(S) —— R¢ 


This story is much generalized in Sect. 7 in terms of topological crystals 
and their standard realizations. 

There are plenty of sources of crystallographic tight frames. 

(1) The (isotropic) tight frame associated with the N-regular polygon 
is crystallographic if and only if N = 3, 4,6. The two-dimensional crystal 
pattern for N = 3,6 is the regular triangular lattice. For N = 4, we have 
the square lattice. 

(2) Platonic solids which yield crystallographic tight frames are the 
tetrahedron, cube, and octahedron. The tetrahedron and cube yield the 
net associated with the body-centered lattice (Figure 16.2 (e)), while the 
crystal net corresponding to the octahedron is the cubic lattice. 

Among all Archimedean solids, truncated tetrahedron, cuboctahedron, 
and truncated octahedron yield crystallographic tight frames; others do 
not! (Figures 16.7 and 16.8). 

(3) An advanced example of tight frames is derived from root systems. 
For the convenience of the reader, let us recall the definition [24]. 

A root system in R® is a finite set ® of nonzero vectors (called roots) 
that satisfy the following conditions: 


1. The roots span R?. 

2. The only scalar multiples of a root x € ® that belong to @ are x 
itself and —x. 

3. For every root x € ®, the set @ is closed under reflection through 
the hyperplane perpendicular to x. 

4. (Integrality) If x and y are roots in ®, then the projection of y onto 
the line through x is a half-integral multiple of x. 


The reflection ox through the hyperplane perpendicular to x is 
explicitly expressed as 


9 The reason why restricted polygons and polyhedra appear in (1), (2) is derived 
from the following general fact: If a finite subgroup G of GL,g(Z) contains an 
element with order n, then g(n) < d, where y(n) is the Euler function, the number 
of positive integers k with 1 < k < n and gcd(n,k) = 1. Thus for d = 2,3, the 
possible order is 2,3, 4, or 6. 

10 We may use Lemma 3.1 to check this (see also Proposition 4.2). 
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KS 


Figure 16.7 Truncated tetrahedron and truncated octahedron 


Figure 16.8 A3 and cuboctahedron 


Ys X) 
ox(y) =y-2 x2 x 


The group of orthogonal transformations of R? generated by reflec- 
tions through hyperplanes associated to the roots of @ is finite and called 
the Weyl group of ®. 

A root system @ is called irreducible if it cannot be partitioned into 
the union of proper subsets such that each root in one set is orthogonal 
to each root on the other. If ® is irreducible, then the Weyl group acts 
irreducibly on R¢. Therefore in view of Proposition 2.1, ® (under any 
ordering of roots) gives a tight frame. ® is redundant in the sense that 
it contains both x and —x. One can take subset @* such that ® = 
@+U—-t and 6+N—-G* = G (the set of positive roots in the root system 
gives such @*). When @ is irreducible, * obviously gives a tight frame. 

The tight frames given by ® and ®* are crystallographic because ® 
generates a lattice (called the root lattice).'! 

It is known that there are four infinite families of classical irreducible 
root systems designated as Ag (d > 1), Bg (d => 2), Cg (d = 3), and Dg 
(d > 4), and the five exceptional root systems Ee, E7, Es, F4, and Gb. 
Among them, Ag, Dg, E¢, E7, Eg give isotropic frames (cf. [17]). The 
crystal nets for Ag and A3 are the regular triangular lattice and the net 
associated with the face-centered lattice (Figure 16.2 (d)), respectively. 


11 @ and ot yield the same crystal net. 
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Figure 16.9 The net for By and Go 


The tight frame associated with A3 coincides with the one coming from 
cuboctahedron (Figure 16.8). 

Interestingly Az (and the net associated with A3) comes up in 
crystallography in various forms; say, in the hexagonal arrangement 
that gives the densest sphere packing, !? the diamond crystal, and the 
diamond twin (see Sect. 6). 

Figure 16.9 depicts the nets for Bp and Gz (compare with Figure 16.5). 

(4) The Leech lattice A discovered by John Leech in 1967 is a highly 
symmetrical lattice in R?4 characterized by the following properties 
[10], [30]: 


e It is self-dual, ic, A* = A. 

e It is even, i.e., the square of the length of any vector in A is an even 
integer. 

e The length of any nonzero vector in A is at least 2. 


The group (called the Conway group Coo) of orthogonal transfor- 
mations which preserves A permutes transitively the 196560 vectors 
x € A with ||x|| = 2. The action of Cog on R*4 is irreducible, so that 
@® = {x € A| ||x|| = 2} gives a crystallographic, isotropic tight frames. 
It is known that Cop modulo its center (designated as Coz) is a simple 
group of order 4157776806543360000 = 27! . 39.54.77. 11-13-23. The 
Leech lattice is not a root lattice, but considered a kin of the exceptional 
root lattice Eg in view of the fact that Eg is the unique self-dual even 
lattice in eight-dimension. 

(5) Let (2, y,z) be a primitive Pythagorean triple; namely x, y, z are 
coprime positive integers satisfying 1? + y? = z?. Put 


n(n (2)-m=G) (2) 


12 This is the Kepler conjecture for which Thomas Hales gave a proof in 1998. 
Johaness Kepler stated this conjecture in the short pamphlet entitled New- Year’s 
gift concerning siz-cornered snow (“Strena Seu de Nive Sexangula” in Latin) in 
1611. 


Topics ON MATHEMATICAL CRYSTALLOGRAPHY 493 


Figure 16.10 A Pythagorean lattice 


One can check that {vi}t_y is a crystallographic tight frame whose 
vanishing subgroup is Z*(z,—2,0,y) + Z'(0, y,—z,x). The crystalline 
pattern associated with this tight frame is what we call a Pythagorean 
lattice [34]. Note that the tight frame {+v}4_, consisting of eight vectors 
is isotropic. See Figure 16.10 in the case (az, y, z) = (3, 4,5). 

Related to Pythagorean lattices is the notion of the coincidence 
symmetry group, which originates in the theory of crystalline interfaces 
and grain boundaries in polycrystalline materials. 

In general, the coincidence symmetry group for a lattice £ in R@ is 
defined as 


G(L) = {g € SO(d)| gL ~ L}, 


where £1 ~ £2 means that two lattices £1 and Lo are commensurable, 
ie., £1. Le is a lattice. Because ~ is an equivalence relation, G(L) 
is actually a subgroup of SO(d). Note that the symmetry group {g € 
SO(d)| g£ = L} is always finite, while G(£) could be infinite in general. 
In the special case £L = Z4, we get G(Z4) = SO(d)N M,(Q), which is a 
dense subgroup of SO(d) (see the remark below). In particular, 


ca) = |(? syle reat pce Ql. 


If p = «/z,q = y/z with a primitive Pythagorean triple (2, y, z), then 


we have 
y/z uf/z )- 
(6) Let £° be the two-dimensional lattice with the Z-basis a1, a2 such 


that |la,||? = |lag||? = 1, a; -ag = —1/2 (note that £° is a lattice whose 
translational action preserves the honeycomb). We then have 


Sty us8 

cic =| ("a A) [pm an paca}, 
37 4 P— 934 

which is also a dense subgroup of SO(2). 
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Put vi = a1, v2 = ag, v3 = —ay — ag. Obviously {vj1, v2, v3} is 
a 1-tight frame associated with the equilateral triangle whose period 
lattice is C°. For any g € G(L"°), the frame {v1, v2, v3, 9V1, V2, gVv3} is 
crystallographic and tight. 

Here are several remarks related to the examples (5), (6). 

(1) The coincidence symmetry group G(L) is dense in SO(d) © the 
lattice £L is essentially rational, i.e., there exists a positive A such that 
(£,£L) C AQ. To sketch the proof, we first note that G(@lL) = G(L) for 
any lattice and a > 0. Thus, in the proof of the implication <, one 
may assume that L is rational. Selecting a Z-basis {aj,...,aq} of £L, we 
consider the symmetric matrix S = '(a,,...,aq)(a1,...,aq), and define 

La(S) = {X = (xj) € Ma(Q), 'XS + SX = O}, 
which is a vector space over Q of dimension ae) because S is rational. 
Then putting 


Q(X) = (a1,...,ag) I — XI + X)"(ar,...,ag)*, 


we have an injective map g : L@(S) —> G(L) whose image is dense 
in SO(d) (g is what we call Cayley’s parameterization.)!> From this 
argument, it follows that G(L£) is dense in SO(d). 

Conversely suppose that G(£) is dense in SO(d), which is equivalent 
to the condition that in the rotation group SO(S) = {A € GZ,q(R)| 
‘ASA = S,detA > 0} for the symmetric matrix S, the subgroup 
SO9(S) = {A € GLa(Q)| ‘ASA = S, det A > 0} is dense. We easily 
find 


{S’ € Mg(R)| ‘AS’A = S$" (A € SO(S)), *S’ = S'} =RS 


(this is equivalent to say that any SO(d)-invariant symmetric bilinear 
form on R¢ is a scalar multiple of the standard inner product). On the 
other hand, the equation for S” given by 


'AS'A = S' (Ae S0Q(5)), *S’ = 8’ 


reduces to a homogeneous linear equation with rational coefficients 
having a nonzero real solution; say, S, so that one can find a nonzero 
rational solution So. Since SOg(S) is dense in SO(S) by the assumption, 
we conclude that ‘4S9A = So (A € SO(S)), So = So, thereby So =AS 
for some A #0, and S being essentially rational. 

(2) For two frames S} = irra naan and Sp = way we define the join 
Si V Sp = {wi}it7™ by 


u; (¢=1,...,M) 
wi = 


vim @=M+1,...,M+QN). 


13 The idea, originally given by A. Cayley, dates back to 1846. 
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(i) (ii) (iii) 
Figure 16.11 Two-dimensional crystallographic isotropic tight frames 


If both S; and Sg are tight frames, then so is Sy V Sg. In order that 
S1 V SQ is crystallographic, it is necessary and sufficient that the period 
lattices £5, and Ls, are commensurable. Obviously the period lattice 
of Sy V So is Ls, + Lg,. 

(3) Because finite subgroups of O(2) are cyclic or dihedral (cf. [33]), 
two-dimensional crystallographic isotropic tight frames are classified as 
follows (note that the order of a cyclic subgroup must be less than or 
equal to 6). 

(a) Frames (i), (ii), (iii) depicted in Figure 16.11. 

(b) Joins of each frame in (a) and its rotation by an element in the 
coincidence symmetry group (more precisely G (Z?) for (ii) and G(L>°) 
for (i) and (iii)). 

(4) In the terminology of crystallography, the intersection £1 £2 for 
commensurable lattices £1 and £9 is called the coincidence site lattice 
(CSL), while the sum £1 + £2 is called the displacement shift complete 
lattice (DSC lattice). For more about the CSL theory, refer to [23], [1], 
[44], [45]. 


4. Parameterizations of Crystallographic Tight Frames 


If a frame S is crystallographic, then the vanishing group H(S) is 
obviously a direct summand!* of Z, and rank H(S) = N —d. Therefore 
H(S) is a lattice in the vanishing subspace W(S) = ker(Ps : RY - R4), 
and hence the subspace H(S)p of RN spanned by H(S) coincides with 
W(S). 

Conversely suppose that H(S) is a direct summand of Z'. Take a 
Z-basis {c1,..., ¢v} of ZN such that {Cg41,---,¢y} isa Z-basis of H(S). 
Then {ps(c1), swag Os lea) is a Z-basis of Ls (remember that ps = 
Ps|ZN ). Therefore S is crystallographic. 


14 A subgroup B of an additive group A is called a direct summand of A if there 
exists another subgroup B’ such that A = B @ B’, i.e., any element a € A can be 
expressed uniquely as a = 6 + b! with b € B,b! € B’. Inthe case A=Z, Bisa 
direct summand if and only if the factor group A/B is free abelian. 
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Proposition 4.1 (Existence and Uniqueness). Let H be a direct sum- 
mand of (a of rank N — d. Then there exists a unique crystallographic 
1-tight frame S of R4 up to congruence such that H(S) = H. 


Proof. Let Hp be the subspace of RY spanned by H, and let P : 

RN —+ R®@ be the frame projection giving a 1-tight frame whose 

kernel is Hg. To prove H(S) = H, note H(S) = ker PAZ" D> H 

and rank H(S) = rank H. Hence we have a finite subgroup H(S)/H of 

ZN /H. Since H is a direct summand, H(S)/H = {0}. 
The uniqueness follows from Proposition 2.4. 


There is a one-to-one correspondence between direct summands of 
rank k in Z" and subspaces of dimension k in Q%. Indeed we have 

(1) Let H c ZN be a direct summand, and let Hg be the subspace of 
the vector space Q% over Q spanned by H. Then Hg ZN =H. 

(2) Let W is a subspace of Q™ of dimension k. Then H = WNZN is 
a direct summand of Z of rank k. 

In fact, (1) is proved in the same way as the proof of the above theorem. 
For (2), suppose that Z /( WZ") is not free. Then there exists « €¢ ZN 
not contained in WA ZN such that nz € WAZ. This implies that 
x € W, and hence « € WN ZN, thereby a contradiction. 

We denote by TS} (R¢) the set of congruence classes of crystallographic 
1-tight frames {vi}, in R®@. Proposition 4.1 together with the above 
remark tells us that TS) (R®) is parameterized by Gry_q(Q¥). 

Gry_g(Q®) is thought of as the set of “rational points” on the 
complex Grassmannian Gr n—q(C% ) by considering it as a subvariety 
of the complex projective space P@(C) (M = () — 1) by means 
of the Pliicker embedding. Here we recall the general definition of 
rational points. Let K be an algebraic number field, and let V be a 
projective algebraic variety, defined in some projective space P”—1(C) by 
homogeneous polynomials f1,..., fm with coefficients in K. A K-rational 
point of V is a point [2,...,2n] in P’-1(K)( c P”™-1(©) that is a 
common solution of all the equations fj = 0. 

Given W € Gry_g(Q4), we shall explicitly construct a 1-tight frame 
whose vanishing group is H = WaZzn (this is the stage where quadratic 
Diophantine equations show up). For this sake, we select nonzero vectors 
ni,...,0q of W such that nj ¢ ZY, (nj,x) = 0 for every x € H, and 
(nj,nj) = 0 for 7 A j (we may do this because for a subspace V of 
Q, the “rational” orthogonal complement Vt = {x € QY| (x,y) = 
0 (y € V)} satisfy QY = V @ V+; then we argue by induction for the 
construction of n;). Write Inj 127 = m? Dj with a square free positive 
integer D; and a positive integer m;. Then putting a; = (mj/D;)~!m, 
we obtain A = (aj,...,aq) € Vq(R). If we define v; by the relation 
(v1,...,vy) = °A, then we obtain a 1-tight frame S = {vie in R@ 
whose vanishing group is H. 
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To express the matrix A more explicitly, we write nj = '(n;,....nNi)- 
Then 
1 ny 1 nj 1 ma 
VDim VD2 me V Da Ma 
a ad 
A= V/ Dd, m4 a/ Do m2 a/ Dg Md 5 (16.8) 
1 ny 1 nyo 1 Na 


VD, m = JSD2 m2 _ V Da Ma 


where njj and mj; satisfy the following Diophantine equation!® 


mi? +--+ nui? = Dim? G=1,...,0), 
Mim +---+nninnj =0 (2479). 
Taking a look at (16.8), we have 


Proposition 4.2. The Gramm matrix for a crystallographic 1-tight 
frame is rational. 


Conversely if we start with a matrix A € Vq(R%) of the form (16.8), 
then the 1-tight frame associated with A is crystallographic in view 
of Lemma 3.1. From this observation, it also follows that the set of 
congruence classes of 1-tight frames is parameterized by 


{Ge My@Q|'G=G, G?=G, rank G = d}. 
We also have 


Corollary 4.3. The period lattice Ls for a crystallographic 1-tight 
frame S is rational in the sense that (Ls5,£L5) CQ. 


We now restrict ourselves to the case d = 2 and N > 2, and consider 
Ty(R2) = = Vo(RX )/SO(2), the set of oriented congruence classes of 
1-tight frames in the plane R?. As remarked before, Vo(R )/SO(2) 
is identified with the oriented Grassmannian Gro (RN ), which is also 
identified with the complex quadric 


Qn = {[z1,-.-, 2] E pte) 27 Sear Zn = 0}. 


This routine procedure is carried out by the map 


ay by 
az bg 

oe Vo(R’) > (ay + b)V—1,...,an + bnv—-l) € cr. 
an bn 


15 A trivial remark is that a ruler-compass construction of the 1-tight frame is possible 
once this Diophantine equation would be is solved. 
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which yields a one-to-one correspondence between T N (R?) and Qn (note 
that the quotient of Qy by the conjugation [z1,..., zy] [Z,.-.-, ZN] 
is identified with Ty (R?)). 

For a direct summand H of rank N — d in Z, define the subspace 
Ly in PN-!(C) by setting 

Ly ={[a,-.-,zn] € PX'©)| ka t+-+-+kwzn =0 
for every ‘(k,..., kw) € Hy}. 

The intersection Qy N Ly consists of two points (one is the conjugate 
of another), both of which correspond to an oriented similarity class of 
a crystallographic tight frame S with the vanishing subgroup H = Hs. 

Applying the above observation in the general case, one can easily 


show that there exists a square-free positive integer D such that the two 
points in Qn N Ly are Q(V/—D)-rational points on Qy. We thus have 


Theorem 4.4. The set 


Qn NL) PX (QW=D)) 
D 


is identified with the set of oriented similarity classes of two-dimensional 
tight frames of size N. 


For an illustration, we shall make a brief excursion into Q(./—D)- 
rational points on Q3. These rational points are related to two- 
dimensional crystal patterns obtained as projected images of the cubic 
lattice mentioned in the Introduction. 

The crystallographic 1-tight with the vanishing group H=Z 
*(n1, n2, n3) corresponds to the point [z1, 22, 23] € P?(C) given by 


a = ng? + ng”, 
2 = —mng + ¥—(m? + np? + 73”), 


23 = —nng = V—(m2 + no? + 032). 


Therefore, D is the square-free part of ny? + no? + 3. 
A question arises: For which square-free D, does the quadric Q3 have 
a Q(./—D)-rational point? The answer is given in the following. 


Proposition 4.5. The quadric Q3 has a Q(/—D)-rational point if and 
only if D is not of the form 8k +7. 


This is, as easily conceived and proved below, a consequence of the 
theorem of three squares due to Legendre!® (1798), which says that a 
positive integer n can be expressed as the sum of three squares if and 
only if n is not of the form 4°(8k + 7). 


16 Legendre’s proof is not complete. It is Gauss who gave a complete proof. 
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From what we observed above, it follows that the quadric Q3 has a 
Q(V/—D)-rational point if and only if the equation ny2+n22+n32 = Dm? 
has a nontrivial integral solution nz, n2, 3, mM. 

We first show that if D is not of the form 8k +7, then ny? + no? + 
n32 = D has an integral solution nj, n2,n3. To this end, suppose that 
ny? + ng? + n3? = D has no integral solution. Then D = 4°(8k + 7) for 
some £ and k by invoking Legendre’s theorem. Since D is square-free, D 
must be of the form 8k + 7. 

Next suppose that D is of the form 8k+7. If ny?-+n2?+n32 = Dm? has 
a nontrivial integral solution n1,n2,3,m, then writing m = 2'(2s + 1), 
we have 


Dm? = 4°(8k + 7)(2s + 1)? = 4°(8(8kt + 7t +k) +7), 


where ¢t = s(s + 1)/2. This contradicts to Legendre’s theorem. 


5. A Height Function on the Rational Grassmannian 


The rational Grassmannian Gryv_q(Q) is dense in Gry_ (RX); in 
particular there are infinitely many congruence classes of d-dimensional 
crystallographic 1-tight frames of size N. This section is devoted to 
a brief explanation of how to count congruence classes, with some 
excursions into a characterization of crystallographic tight frames by 
means of a certain minimal principle. The tool that we employ is a 
natural height function on the rational Grassmannian. 

We define the height function h on Gr;(Q) in the following way. For 
W € Gr; (Q), we let H= WN ZN, which is a direct summand of Z as 
noticed before, and is an integral lattice in Hp(= Wp). We then put 


h(W) = vol(He/H)( = |H* /H|/?), 


where the inner product on Hp is the one induced from the standard 
inner product on RN. 
The function h deserves to be called a height function because we have 


Proposition 5.1. For any c > 0, there are only finitely many W € 
Gr,(QX) such that h(W) < c. 


Proof. It suffices to prove that for every c > 0, there are only finitely 
many direct summands H of rank k such that vol(Hp/H) < c. To this 
end, we shall introduce the quantity c(H). For a Z-basis J = {c1,...,cz} 
of H, we put 


c(J) = max((|e1|,---, lex), 
and define 
c(A) = min c(J). 
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It is straightforward to check that there are only finitely many direct 
summands H of rank k such that c(H) < c. Therefore, it suffices to 
prove 


Cc(A) < vol(Hp/H) 


with a positive constant C' not depending on H. For this sake, we invoke 
the fact that there exists a Z-basis (called a reduced basis) w1,...,uUz of 
Hf satisfying 

(i) [furl] < [Ixl] for all x € H\{0}, 


(ii) if w1,...,uj—1,x is a part of a Z-basis of H, then ||u,|| < ||xll. 


This property of u,,..., uz implies ||uy|| < ||ueal| < --- < ||ug||. Moreover 
a theorem in the geometry of numbers due to Minkowski asserts that 
there exists a positive constant Cz such that 


uj||--- lug || < Cyvol(Ar/ A) 


(cf. [6]). By the definition of c(J), if J = {u1,..., ug}, then c(J) = |lugll, 
so we find c(H) < |lug||. Since |luj|| => 1 for i = 1,...,k — 1 (note 
H CZN), we have 


cd 


c(H) < |lugll < |luill--- llugll < Cevol(Hr/H). 


> 


This completes the proof. 
Remark 5.2. (1) By Hadamard’ inequality, we have 
vol(Hg/H) < |lei||--- llexll < e(J)* 
for any Z-basis J = {c1,...,¢%}; therefore 
vol(Hp/H) < c(H)*. 
(2) For a direct summand H = Z'(nj,...,ny) of ZN, 
h(Hg) = Vm? +-+- + ny?. 


Using this, we obtain the asymptotic formula 
1 
{We PN-!@Q) = Gr (QY) |A(W) < h}| ~ 5o(N) tonh® (h > 00), 


where wy = 2 %/2/7(1 + N/2), the volume of the unit disk in RY, and 
¢(s) is the Riemann zeta function. It is interesting to ask whether a 
similar asymptotic formula holds for Gr; (QY). 

The height function h(W) is closely connected with crystallographic 
tight frames. 


Proposition 5.3. vol(R?/Ls) = h(Hg)~!( = vol(Hg/H)~!) for a 
crystallographic 1-tight frame S = wae whose vanishing subgroup 
is H. 
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Proof. Let S = {vi}, be a crystallographic 1-tight frame of R4, and 
let P = Ps : RN —> R” be the frame projection. Again take a Z-basis 
{c1,..., en} of ZN as before such that {Cd41,---,¢n} is a Zbasis of 
the vanishing subgroup H(S). Note that | det(c1, Sos ,cn)| = 1 because 
(c1,...,¢y) € GLy(Z). Thus for the square matrix C = ((c;, ¢;)) with 
integral entries, we find that det C = (det(c1, vit ,en)) =H, 

Putting b; = P(c;) (¢ = 1,...,d), we have a Zbasis {b1,...,bg} of 
the period lattice Ls. Using ‘PP(c;) — ¢; € ker P, we may write 


N 
"Pibi)-—ci = D> figej- 


j=d+1 
N 
Substituting ‘PP(c;) = q+ y, fige; into (‘PP(ci), cg) = 0 (k= d+ 
j=d+1 


1,...,5), we have 


—(C;, Cx) _¥ fig Cj, Cx). 


j=d+1 


ca (Ou Cp 
Cot Cho}’ 
where Ci, € M,(R) and using these matrices of small size, we may 
compute the matrix F = (fj;) € Ma,n—a(R) as 


Now writing 


F= Sie O5e 


Furthermore, 


N 
(b;,b;) = (‘PP(ci), ej) = (« + a fuss) 


k=d+1 


= (Cj, Cj) is Sik (Ck, €5)- 


k=d+1 
Therefore, the matrix B = ((b;,b;)) € Mg(R) is computed as 
B= C+ FOg = Ci — CiaCxy' On (16.9) 


(this tells us that B is a rational matrix; thereby giving an alternative 
proof of Corollary 4.3). It is readily checked that 


Cu oe ( I O’\ _ as — O12 C55 Cai a) 
Cx Cho) \—Cy' Ca I O Coo)? 
so det C = det(Ci1 — Cie Ce C51) - det Cog = det B - det Coo. Because 


det C = 1, det B = vol(R4/Lg)”, and det Cy = vol(Hg/H)?, our claim 
follows. 
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The following gives a characterization of crystallographic 1-tight 
frames by means of a minimal principle. 


Proposition 5.4. For a crystallographic frame S = any in R® with 
the vanishing group H, we have 


N 
Yo lIvall? = d-vol(R4/L3)?/ 4h Hg)?! 4( = d-vol(R4/Ls)7/4|H* /H|!/4), 
i=1 
The equality holds if and only if S is tight. 

It should be noted that the quantity vol(R4/L5)~2/4 DN, |Ivill? 
depends only on the similarity class of the crystallographic frame S. 
Proof. For any positive symmetric matrix B € M,g(R), we have 


tr B > d(det B)'”*, 


where the equality holds if and only if B = alg for some a > 0 (this is 
easily deduced from the inequality of arithmetic and geometric means). 
Applying this inequality to B = ‘AA, we find that 


N 
Yo lvall? = d(det 85)1/4 
ol: 


for any frame S = {vite 1, where the equality holds if and only if S is 
tight. 

We now let S be the frame operator associated with S. Recall that 
S is symmetric and positive, whence one can find a (unique) symmetric 
positive operator S~!/? with (S~1/2)? = §-!. Then 


N 

yi soy \g~ V2, = g-l/2 Ys goes: vi)v 

i=1 

=o agg Wx =x. 

Therefore, U = {S~\/ Fae , isa 1-tight frame whose vanishing subgroup 
is obviously H. Thus py = S~!/2 pg and 

vol(R@/Image pu) = det S~'/2vo1(R4/Image ps), 
from which it follows that 


det $ = vol(R4/Ls)°vol(Hp/H)?. 


This completes the proof. 


In Theorem 4.4, we observed that a square-free positive integer D 
is associated with each two-dimensional tight frame S. We close this 
section with establishing a relationship between the integer D and the 
height of H(S)q. 
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Proposition 5.5. Let [1,...,zn] € Qn be a Q(V/—D)-rational point 
corresponding to a two-dimensional tight frame S = {ve} 9 whose 
vanishing group is H. Then D is the square-free part of h(Hg)? (= vol 
(Hp/H)*). 


Proof. One may assume z; € Q(/—D). Let wi, we € C be a Z-basis 
of the period lattice Ls (we are working in C instead of R?). Writing 
w= ai + biV—D € QW/—D), we find 


N 
vol(C/Ls)* Yi livall? = vol(C/Ls) "Wal? +--+ + lawl?) 
i=1 
1 


= (lal? +--+ + lenl?), 
|ay bg — agh\|/D 


which is equal to 2vol(Hp/H) in view of Proposition 5.4. Because 
||? +--- + |zy[? is rational, the claim is proved. 


6. Tight Frames Associated with Finite Graphs 
and Combinatorial Abel’s Theorem 


The notions of Jacobian and the Picard group together with Abel’s 
theorem play a significant role in classical algebraic geometry. The aim 
of this section is to introduce combinatorial analogues of these notions 
by using certain tight frames associated with finite graphs. 

We first fix some basic notations and terminology. A graph is 
represented by an ordered pair X = (V, FE) of the set of vertices V and 
the set of all directed edges E (note that each edge has just two directions, 
which are to be expressed by arrows). For an directed edge e, we denote 
by o(e) the origin, and by t(e) the terminus. The inversed edge of e is 
denoted by €. With these notations, we have o(e€) = t(e), t(€) = o(e). 
An orientation of X is a subset E° of E such that FE = E° U E° and 
E° E° = %. We use the notation E, for the set of directed edges e 
with o(e) = x. Throughout, the degree deg x = |Ez| is assumed to be 
greater than or equal to three for every vertex x. 

We let Xo = (Vo, Eo) be a finite connected graph that is regarded 
as a one-dimensional cell complex. From now on, we shall make use 
of (co)homology theory of cell complexes. Let K be Z,Q, or R, and let 
0: Cy(X0, K) — Co(Xo, K) be the boundary operators of chain groups; 
namely the homomorphism defined by d(e) = t(e) — o(e) (e € Eo). The 
1-homology group Hj(Xo, Z) = ker 0 is a direct summand of the 1-chain 
group C,(Xo, Z), and is a lattice of Hj (Xo, .R). We denote by by = 61 (Xo) 
the betti number, i.e., b) = dim Hy (Xo, R). 

Define the natural inner products on Co(Xo,R) and Ci(Xo0,R) by 
setting 
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ae 1 (t#=y) 
0 («@#y), 
and 
1 (e’ =e) 
(e, e’) = =A (e’ = e) 


0 (otherwise). 


The set of vertices Vo constitutes an orthonormal basis of Co(Xo,R), 
while an orientation Ey of Xo yields an orthonormal basis of C1(Xo, R). 
With the inner product on Hy (Xo, R) induced from the one on C) (Xo, R), 
the lattice H, (Xo, Z) is integral, and hence the dual lattice Hy (Xo, Z)# 
contains Hy (Xo, Z). 

Let a* : Co(Xp9,R) — Ci(Xo,R) be the adjoint operator of 0 with 
respect to the above inner products. It is straightforward to see 


a*c=- ) > e (ce Vo), 


ec Hor 


so 8*(Co(Xo,Z)) € Ci(Xo,Z). We also have Hy(Xo,R) = (Image a*)~, 
and hence C1 (Xo, R) = Hi (Xo, R) @ Image d* (orthogonal direct sum). 
We denote by Po : C1(X0,R) — Hi(Xo,R) the orthogonal projec- 
tion, and put vo(e) = Po(e) (e € Ep). Since ker Pp = Image 0*, we 
get 
> vole) =0 (re Vo). (16.10) 
ec Hox 
Lemma 6.1. S= {vo(e)} emo is a crystallographic 1-tight frame (and 
0 
hence {vo(e)} com is 2-tight). Its vanishing group is d*(Co(Xo, Z)), and 
its period lattice is Hy (Xo, Z)*. 
Proof. ‘The first claim is obvious because the orientation Ey gives an 
orthonormal basis of C,(Xo,R), and vo(e)’s are the projected images of 


this orthonormal basis by the orthogonal projection Po. 
For the second claim, it suffices to show that 


(Image 0*) N C1(Xo, Z) = d*(Co(Xo, Z)). 


To this end, take any 0-chain a = )°,. Vo Gt € Co(Xo,R) such that 
a*a € Ci(Xo, Z). 


a=—- >> Yo aze=— D> ageye= >> (ate) — Axe) es 


te Vo e€ For ee Eo eckEg 


SO t(e) — Goce) € Z for every e € Eo. This implies that there exists a real 
number a with ag + a€Z (a € Vo). Putting 
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B= >> (az + a) € Co(Xo,Z), 


reVo 


we obtain d*a = 0*B. This proves the claim. 
To prove that Po(Ci(Xo, Z)) = H,(Xo,Z)*, take a spanning tree T’ 
of Xo, and let 


el, €2,- meg €b, > El; €2, sess » Cb, 
be all directed edges not in T’. The vectors vo(e1),..., vo(ep,) constitute 
a Z-basis of the lattice group H,(Xo,Z)*. This is so because we may 
create a Z-basis of H1(Xo, Z) consisting of circuits c1,..., cp, in Xo such 


that c; contains e;, and 
(ci, vo(e;)) = (ci, Pole;)) = (Po(ci), ej) = (ci, ey) = by, 


namely {vo(e1), sae ,vo(eD, )} is the dual basis of {c1,..., cp, }. From this, 
our claim immediately follows. 


Let us exhibit two instructive examples. 

(1) Let Ag41 be the graph depicted in Figure 16.12. 

Then {e1,..., €q41} is an orthonormal basis of Ci(Ag+1,R), and e1 — 
€2, €2 — €3,.--,€d — €d+1 is a Z-basis of Hy (Ag+, Z). One can check 


1 
voles) = & — Fo > eg: 
j=l 


Thus {vo(e:)} ot is the tight frame associated with the equilateral 


simplex (see the proof of Proposition 2.5). 

It should be pointed out that vo(e;) — vo(ej) = ej — ej (@ # J) are 
vectors representing ridges (edges) of the simplex, and that ® = {e; — 
ej| 7 # J} is the irreducible root system of type Aq with simple roots 
€1 — €2, €2 — €3,.--,€q — €d4+1- 


€1 Cd+1 


y 


Figure 16.12 Graph Ag44 
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Figure 16.13 Graph K4 


This example has something to do with the diamond crystal (in the 
case d = 3). 

(2) Related to the diamond twin mentioned in Introduction is the 
complete graph K4 with 4 vertices. See Figure 16.13 for an orientation 
(the labeling of edges). 


Take four closed paths c. = (e2,f1,@), ce = (6,f2,4), c = 
(e1, f3, €2), c4 = (fi, fo, fg) in K4. Then cy, c2, c3, c4 as 1-chains satisfy 
cr + co + 03 + c4 =O, ler ll? = Ileal? = llegll? = Wleall? and (cj, cj) = —1 


(i #7) (c1, c2,c3 constitute a Z-basis of Hy (K4,Z)). Moreover, cj, C2, 
C3, C4 are vectors represented by directed segments joining the origin 
and vertices of the regular tetrahedron. An easy computation gives 


1 1 1 
vo(e1) = 163 — 2), vole) = ria — ¢3),  vo(e3) = ra — ¢1), 


vo(fi) = ver —¢4), vol) = ile — ca), vo(f3) = aC) — c4). 
This implies that { + vo(e;), +vo(fj)} is the root system A3. 

Now let us proceed to a combinatorial analogue of Abel’s theorem. 
The classical Abel’s theorem in algebraic geometry gives a link between 
regular (holomorphic) maps from a complex projective algebraic curve 
into two kinds of complex tori (called the Jacobian and Picard group, 
respectively). In the graph-theoretic setting, certain finite abelian groups 
are to act as complex tori, and the counterparts of the regular map into 
the Jacobian is defined by using the tight frame given in Lemma 6.1. 
The approach explained from now is somewhat different from the one in 
[37], [38] (see also [31], [3] for different ways to introduce the concepts). 

A key role is played by the direct sum Hj(Xo,Z) + 0*(Co(Xo,Z)). 
Because this is a sublattice of C1(Xo0,Z), the factor group 


C1 (Xo, Z)/[Hi (Xo, Z) + 8*(Co(Xo, Z))| 


is a finite abelian group. We shall rewrite this group in two ways; one 
is J(Xo), an analogue of Jacobian; another is Pic(Xo0), an analogue of 
Picard group. 
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In view of the above lemma, the projection Pop induces an isomorphism 
C1 (Xo, Z)/3*(Co(Xo,Z)) —> Hi (Xo, Z)*. 
Notice that the inclusion 
A (X0,Z) — C1(Xo, Z) 


induces an injection H,(X9,Z) — C1(Xo, Z)/0*(Co(Xo, Z)), so that 
A (Xo, Z) is identified with a subgroup of C1 (Xo, Z)/0*(Co(Xo, Z)). The 
double factor group 


[C1 (Xo, Z)/d*(Co(Xo, Z))|/ Hi (Xo, Z) 


is isomorphic to Ci (Xo, Z)/[H1(Xo,Z) + 9*(Co(Xo, Z))|. Thus, we have 
an isomorphism 


C1 (Xo, Z)/[Hi(Xo, Z) + 8*(Co(Xo, Z))| —> Hi(Xo, Z)* /Hi(Xo, Z), 


where we should note that 7(Xo) = Hy (Xo, Z)* / Hy (Xo, Z) is a finite 
subgroup of the torus group J(X0) = Ai (Xo, R)/Ai (Xo, Z). One may 
think of (Xo) as a combinatorial analogue of Jacobian. 

On the other hand, the boundary operator induces an isomorphism 


Ci (Xo, Z)/H1 (Xo, Z) — kere, 


where € : Co(Xo, Z) —> Z is the homomorphism (called argumentation) 
defined by 


€ yo Gt |S) ae. 


xe Vo ze Vo 
Thus in a similar way as above, we obtain an isomorphism 
C1 (Xo, Z)/|Hi (Xo, Z) + 3*(Co(Xo, Z))| — ker €/83*(Co(Xo,Z)), 


where ker € is regarded as an analogue of the group of divisors with 
degree 0, so that we denote it by Div? (Xo) after the notation in algebraic 
geometry. Meanwhile 0* (Co(Xo, Z)) is regarded as an analogue of the 
group of principal divisors (see [3] for the reason). Therefore, we denote it 
by Prin(Xo). Remembering again the terminology in algebraic geometry, 
it is justified to write Pic(Xo) for the factor group Div? (Xo) /Prin(Xo) 
and to call it the Picard group of Xo. 
Summarizing the argument, we have 


Proposition 6.2. There is a natural isomorphism gy of the Picard 
group Pic(X9) onto the Jacobian J (Xo). 


To describe g more explicitly, pick up a reference vertex zo € Vo. For 
xz € Vo, select a path c = (e€1,..., €n) in Xg such that o(c)(= o(e1)) = x 
and t(c)( = t(€n)) = x. Then regarding c as a 1-chain, we get 0c = 4-10. 
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From the way to construct g, we easily find that g brings x — a € kere 
modulo 94*(Co(Xo, Z)) to vo(e1) + +--+ volen) € Hi(Xo,Z)* modulo 
A, (Xo, Z). 

Now define the combinatorial Albanese map ®*! : V —> J(Xo) by 
setting 


@"(x) = vo(e1) +--+: + volen) € Hi(Xo,Z)* modulo Hy (Xo, Z), 


where the sum on the right-hand side is an analogue of the line integral 
of a holomorphic 1-form, which appears in the definition of classical 
Albanese maps (we shall see in the next section that vg deserves to be 
called “harmonic” as a cochain of Xo). We also define the combinatorial 
Abel-Jacobi map ®*) : Vo —> Pic(Xo) by 


@ (x) = x — 19 € Div?(Xo) modulo Prin(Xo). 


By definition, we obtain @*! = yo 64), This is nothing but an analogue 
of classical Abel’s theorem. 

To describe the structure of 7(Xo0), we consider the integral matrix 
A= ((a;, a;)) € My, (Z) where {a1,...,0,} is a Z-basis of Hi (Xo, Z). 
Applying the theory of elementary divisors to A, we find P, Q € GL», (Z) 


and positive integers k1,..., kp, such that 
ko 6 OFF CUO00leee 0 
PAg=|° me os oe (16.11) 
0 O O ++: kp, 


where kj divides ki41 (¢ = 1,..., 61 — 1). The array (k1,...,ky,), which 
depends only on Xo, determines the structure of 7 (Xo); that is, 7(Xo) = 
Zi, X+++X Zk, + For instance, using this fact, we find 7(Ag+1) = Za+1, 
and J (Kn) = Zn)". 

We have more about 7(Xo). Algebraic graph theory [4], [5] allows us 
to establish the fact that the order of 7(Xo) (and Pic(Xo)) is equal to 
K(Xo), the tree number for Xo, which is defined to be the number of 
spanning trees in Xo; therefore kj --- ky, = «(Xo) (see [38]). Further the 
canonical inner product on H (Xo, R) induces a flat metric on the torus 
J(X9) = Hi (Xo, R)/ Hi (Xo, Z) for which, in view of (16.7), we have 


vol(J(Xo)) = «(Xo)"/?. 


The Jacobian 7(Xo) has another appendage. A nondegenerate sym- 
metric bilinear form on J7(Xo) with values in Q/Z is induced from the 
inner product on Hj(Xo,R). One may think of this bilinear form as an 
analogue of “principal polarization” 

Before closing this section, we make a minor remark on direct 
summands of ZN , which is related to the above discussion. Let H be 
a direct summand of Z of rank k. We put 
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Ht2 = {xe Z| (x, H) = 0}, 


which is clearly a direct summand of Z of rank N—k. We easily observe 
that (H42)"" = H. Although (H12), = Hd and Hp ® Hd = RN, the 
direct sum H @ H+ does not agree with Z in general. Indeed ZN /(H® 
val +2) is isomorphic to H# /H (this is actually a generalization of what 
we have seen above). To prove this, let P : RN —,. RN be the orthogonal 
projection whose image is Hp. It suffices to check that P (ZN) =H #. for 
if this is true, then P|ZN induces an isomorphism of ZN /Htz onto H# 
because Ker P|ZN = H+2. Take a Z-basis {a,...,ay} of Z% such that 
{aj,...,a,} is a Z-basis of H. Since A = (aj,...,ay) € GLy(Z), there 
exists B = (b,..., by) € GLy(Z) such that ‘BA = Iy, or equivalently 
(b;,aj;) = 46;;. Therefore, 


(P(b;),a;) = (bj, P(a;)) = (bj,aj) = 35 <j <hk), 
which implies {P(b1), exacal P(b,,)} is a Z-basis of H*#. Therefore, 


P(ZN) = H#, as required. 
Using |Z" /(H ® H12)| = |H7*/H|, we find 


vol(H/H)vol( Ht /H+2) = vol(R% /(H ® H+2)) 
|Z /(H @ H+2)|vol(RN /Z%) 


=|H*/H\, 


sO 
vol(H /H+2) = vol(He/H) = |H*/H|"/?. 


7. Standard Crystal Models and Tight Frames 


The combinatorial Albanese map pl: Vo —> J(Xp) extends toa piece- 
wise linear map of Xo into the flat torus J(Xo) = Ai (Xo, R)/Ai (Xo, Z): 


OF Nees Xa 


which, if we think of Xo as a (singular) Riemannian manifold, turns out 
to be harmonic in the sense of Eells and Sampson [18] (see [25] for the 
detail). 

Let AGP be the maximal abelian covering graph over Xo, i.e., the 
abelian covering graph over Xg whose covering transformation group is 
H,(Xo,Z). Consider a lifting Sy : X8> —> H,(Xo,R) of &7!, which 
obviously satisfies 


Dolor) = Po(z) +a (o € Hy (Xo, Z)). 
The image Do(X2) is considered a b(Xo)-dimensional crystal net. For 


instance, if Xo = Aq (resp. Xo = K4), then Do(X2) is the diamond 
crystal (resp. the diamond twin); see the previous section. 
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Having this observation in mind, we consider general abelian covering 
graphs over Xo and their realizations. A d-dimensional topological graph 
is an infinite-fold abelian covering graph X = (V, FE) over a finite graph 
Xo whose covering transformation group is a free abelian group L of 
rank d. Theory of covering spaces tells us that there is a subgroup H 
(called a vanishing subgroup) such that H(Xo,Z)/H = L (note that H 
is a direct summand of Hi(Xo, Z)). Actually the topological crystal X is 
the quotient graph of ee over Xg modulo H. In this view, we call Xe 
the mazimal topological crystal over Xo. 

A (periodic) realization is a piecewise linear map ® : X —> R4 
satisfying 

Poor)=P(z)+p) Wel), 


where p : L —> R® is an injective homomorphism whose image is a 
lattice in R¢.!7 We call p (resp. p(L)) the period homomorphism (resp. 
the period lattice) for ®. 

By putting v(e) = &(t(e)) _ D(o(e)) (e € E), we obtain an [- 
invariant function v on E which we may identify with a 1-cochain v € 
C!(Xo,R®) with values in R¢. Because v determines completely ® (up 
to parallel translations), we shall call v the building cochain!® of ®. One 
can check that if we identify the cohomology class [v] € H!(Xo, R?) with 
a homomorphism of H1(Xo,Z) into R4 (the duality of cohomology and 
homology), then [v] = po“, where yw : Hy (Xo, Z) —> L is the canonical 
homomorphism. In particular, Ker[v] = H and Image [v] = p(Z). 


Lemma 7.1. [37] Giving a periodic realization of a topological crystal 
over Xo is equivalent to giving a 1-cochain v € COG; R*) such that the 
image of the homomorphism [v| : Hi(Xo,Z) —> R¢ is a lattice in R®. 


We are now at the stage to give the definition of standard realizations. 
Let Hp be the subspace of H,(Xo,R) spanned by the vanishing group 
H, and He the orthogonal complement of Hp in Hi(Xo,R): 


Hi (Xo,R) = Hr © Hy. 


Then dim Hye = rank L = d. By choosing an orthonormal basis of 
ies we identify ie with the Euclidean space R¢. Using the orthogonal 
projection P : Hi(Xo,R) —> Ht, we put v(e) = P(vo(e)). Then one 
can check that v is the building cochain of a realization @ : X —> R¢. 
We call ® the normalized standard realization of X. If we say simply 


17 The network ®(X) could be “degenerate” in the sense that different vertices of 
X are realized as one points, or different edges overlap in R?. But we shall not 
exclude these possibilities. 

18 In [37], [38], the term “building block” is used. The idea to describe crystal 
structures by using finite graphs together with vector labeling is due to [9] 
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Figure 16.14 Standard realizations 


“standard realization”, it means a realization obtained by performing a 
similar transformation to the normalized one. 


In view of the properties of vo established in the previous section, we 
find 


(1) (Harmonicity) 


Y> v(e)=0 (xe Vo), (16.12) 
ecEoxr 
(2) (Tight-frame condition) 
Y- (x, v(e))v(e) = 2x (x € R%). (16.13) 
ecko 


Furthermore {v(e)} Eo is crystallographic; namely it generates a 


lattice. Since this lattice contains the image of [v], the period lattice 
is essentially rational. 


Remark 7.2. The direct sum H © 4*(Co(Xo0,Z))(_ C C1(Xo,Z)) is 
contained in the vanishing subgroup for the tight frame {v(e)}¢e Fe; 
however they need not coincide with each other. 


Figure 16.14 exhibits a few more examples of standard realizations 
(the picure on the right side is a tiling of pentagons with picturesque 
properties that has become known as the Cairo pentagon). 

Remarkable features of standard realizations are embodied in 


Proposition 7.3. (1) The standard realization of X is the unique 
minimizer, up to similar transformations, of the energy!® 


E(®) = vol(R4/p(L)) 7!" Y* Iwo? 


ecko 


19 The energy defined here is considered the potential energy per unit cell when 
we think of the crystal net as a system of harmonic oscillators. Clearly this is 
similarity invariant. 
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2) Let 6 : X —+ R® be the standard realization. Then there exists 
a homomorphism «x of the automorphism group Aut(X) of X into the 
congruence group M(d) of R@ such that 

a) when we write k(g) = (A(y), b(9)) € O(d) x R4, we have 


P(gr) = A(g)P(z) + b(g) ss (EV), 
b) the image x (Aut(X)) is a crystallographic group, a discrete co- 


compact subgroup of the motion group of R4 (see[8]). 


The assertion (1) is a direct consequence of Proposition 5.4. The 
second one tells us that the standard realization has maximal symmetry. 
See [38] for the proof relying on an asymptotic property of random walks 
on topological crystals (see also [27],[25],[28]). 

Equation (16.12) says that the cochain v is “harmonic” in the sense 
that dv = 0 where 6: C!(Xp,R%) — C!(Xo,R®%) is the adjoint of the 
coboundary operator d : C°(Xo,R%) — C!(Xo,R%) with respect to 
the natural inner products in C’(Xo9,R%). A discrete analogue of the 
Hodge—Kodaira theorem, which is almost trivial, assures us that the 
correspondence v € Ker 6 +> [v] « H 1(Xo, R®) is a linear isomorphism 
(hence dim Ker 6 = db (Xo)). Thus given p, there is a unique harmonic 
cochain v with [v] = pow. A realization satisfying (16.12) is said to be 
a harmonic realization [25] (or an equilibrium placement [14]), which is 
characterized as a minimizer of € when p is fixed.?° 

Let v € C!(Xo,R®) be a general building cochain satisfying [v] = pow. 
The distortion of the realization given by v from the harmonic one is 
measured by the 0-cochain f € C°(Xo, R®) defined by 


f(r)= Y° v(e)( = —(v)(2)), (16.14) 


ec Hor 


which is considered the resultant force acting on the “atom” x when we 
regard the crystal net as a system of harmonic oscillators. Obviously 


ys f(x) = 0. (16.15) 


rE Vo 


Conversely if f € C°(Xo, R%) satisfies (16.15), then there exists a unique 
building cochain v satisfying (16.14) and [v] = po w. Indeed this is a 
consequence of Image 6é = (ker d)+ and ker 5 = (Image d)+. 

Eigenvalues of the frame operator S : R¢ —> R®@ associated with a 
building cochain v gives an information about how much the harmonic 
realization ® is distorted from the standard one. More precisely, if A™!™ 
(resp. A™°*) is the minimal (resp. maximal) eigenvalue of S', then the 


20 Looking at things through discrete geometric analysis [36], one can see lots of 
conceptual resemblance between crystallography and electric circuits. 
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ratio R(®) = A™=*/,™in(> 1) is considered representing the degree of 
distorsion. Indeed R(®) = 1 if and only if @ is standard. 

We associate the flat torus Hp/H with a vanishing subgroup H Cc 
Ay (Xo,Z). As before, we easily observe that vol( Hp / H)? is an integer, 
and can prove, by modifying slightly the argument used in the proof of 
Proposition 5.1, that for any c > 0, there are only finitely many H of 
rank b;—d such that vol(Hp/H) < c. For this, we just work in Hj (Xo, Z) 
instead of Z. 

Let Ly (C R¢) be the period lattice for the normalized standard 
realization of the topological crystal corresponding to H, and put 
J(Xo,H) = R¢/Ly (thus J (Xo, {0}) = J(X0)). Imitating the proof of 
Proposition 5.3, one may prove 


vol(J(Xo, H)) = vol(J(Xo)) /vol(Hg/H) = k(Xo)'/*/vol(Ha/H). 


As an application of this fact, we take up the issue of “reality” of 
the standard realization; namely we ask how much part of the family 
of crystal models is occupied by standard ones which look like genuine 
crystals. A rough answer is that if we fix the base graph Xo, “most” 
standard realizations do not look realistic. 

To be more precise, we start with the inequality 


Y= liv(epil7 xl? = 5 (wee), x)? = 2lixll?, 
ecko ecko 


from which we get 


max ||v(e)|| > (2/| Bol)”. (16.16) 
ecko 


This implies that the maximal length of v(e) is bounded from below by 
a positive constant depending only on the base graph Xo. On the other 
hand, there exists a positive constant cg such that for any lattice group 
Lin R?¢ 
min ||x|| < cqvol(R2/L)1/4 
a | < cqvol(R"/L) 

(this is the celebrated “convex body theorem” due to Minkowski; cf. 
[30]). Applying this fact to the lattice group £y, we find a nonzero 


x= p(o) € Ly such that ||p(o)|| < cavol(J(Xo, fi)" and hence 


|P(ox) — &(2)\| < cavol(J(Xo, H))'/". 


(16.17) 
These facts enable us to establish the following theorem. 


Theorem 7.4. Let c be a positive constant with c < 1. For a fixed Xo, 
there are only finitely many d-dimensional topological crystals X over 
Xo whose standard realization satisfy 


(i) ©: X —> R? is injective, 
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(ii) |@(z) — Dy) || < ¢ max lv(e)|| => y is adjacent to x, or y=. 
ec Ey 


From the nature of genuine crystals, the first condition sounds natural. 
The second condition roughly means that two “atoms” close enough to 
each other must be joined by a bond.?! It is likely that the conclusion in 
the theorem is true for any reasonable definition of “reality” of crystals. 

The proof goes as follows. Suppose that there exist infinitely many 
Xn satisfying (i) and (ii), and let H, be the vanishing subgroup 
corresponding to Xy,. Then vol((Hn)e/Hn) tends to infinity, so that 
vol(J(Xo, Hn)) tends to zero as n — oo. Take an integer k with 


k > max deg = maxdeg z. For a given € > 0 with € < c(2/|Eo|)/?, 
reXo vex 


choose n such that cavol(J(Xo, H,))'/4 < €/k. By (16.17), one can find 
a nonzero o € Ly, = Hy(X0,Z)/Hn such that |Pn(o'x) — @,(2)|| < «, 
where @,, is the normalized standard realization of X,. Picking up 
a vertex « satisfying maxeeg, |lv(e)|| => (2/|Eol)!/? (see (16.16)), and 
putting 2; = o'x, we obtain k distinct vertices 21,..., 2% such that 
| P (xi) — B(x)|| < € < c max ||v(e)]l. 
eck, 

Therefore by the condition (ii), 2;’s are adjacent to x. This implies thet 
deg « > k, thereby a contradiction. 

The set of similarity classes of standard realizations of all d- 
dimensional topological crystals over Xo is identified with the rational 
Grassmannian Grp,—a(Ai (Xo, Q) (b1 = bj (Xo)). As expected from the 
discussion in Sect. 4, there is a special feature of the parameterization 
of two-dimensional standard realizations. To explain this, we introduce 
the complex vector space 


H= ize C(X%,0©) a z(e) = 0 (x € Vo) 


e€ Lor 


This is nothing but the space of harmonic cochains (we are identifying 
R? with C), so we find that dimcH = b,(Xo) We denote by P(H) 
the projective space associated with H, and by Q(Xo) the quadric 
defined by 


Q(Xo) = 4 [2] € PUHD| SY xe)? =0 


ec Eo 


21 This condition may not be enough (or may be too strong) to characterize the 
“reality” of a crytstal model because the physical and chemical aspects of crystals 
are ignored. In particular, the electron clouds, which are responsible for chemical 
bonding in crystals, are not involved in the simple network models. 
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Figure 16.15 The kagome lattice 


If we fix an orientation Ey = {e1,...,en}, then by the correspondence 
Lt (z(e1),...,2(en)), we may think of Q(Xo) as a (b1(Xo) — 2)- 
dimensional subvariety of the complex projective space PN-1(C). Then 
the intersection 


Q%)A LJ PX *(Qv—D)) 
D>0 
square-free 
is identified with the family of all oriented similarity classes of standard 
realizations of two-dimensional topological crystals over Xg [39]. This is a 
straightforward generalization of the observation in Sect. 4. We may also 
prove that D is the square-free part of «(Xo)vol(Hp/H)? provided that 
Hf is the vanishing subgroup for the standard realization corresponding 


to a point in Q(Xo)N PN-!(Q(V/—D)). 


Example 7.5. (1) The kagome lattice (Figure 16.15) corresponds to 
the Q(./—3)-rational points 


a 14/23 1+ /-3 caine 1 


’ pol, ’ 
2 2 2 2 


of the two-dimensional projective quadric 
5 = ie 2 
{[21, 22, 28, 24, 5, %] € P?(C); 2° +---+ 4% =0, 
A+%= Btu, DMDtmMH=mt+%, Bt+%= 2+ Bh}. 


(2) Figure 16.16 is the so-called dice lattice (also referred to as the 73 
lattice). This corresponds to Q(./—3)-rational points 


I -l+J/-3 -1FV/-3 1 1+ /-3 ad 
? 9) | 2 bf o] 2 ’ 9 


of the quadric 


{[z1,---, 2] € P?(C)| a2+---+267 =0, tates =0, watatz = 0}. 
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co 


Figure 16.16 Dice lattice 


Remark 7.6. Two examples above are realizations of two-dimensional 
topological crystals which come from periodic tilings (tessellations of tiles 
in the plane which are periodic with respect to the translational action 
by lattice groups). Easy topological considerations lead to the fact that 
there are only finitely many two-dimensional topological crystals over 
a fixed finite graph Xo whose standard realizations are 1-skeletons of 
tilings. In other words, there are only finitely many rational points in 
Q(Xo) which correspond to tilings. What we need to notice is that a 
tiling induces a cellular decomposition of the two-dimensional torus, and 
that there are only finitely many ways (in a topological sense) to attach 
2-cells to a finite graph in order to obtain a torus. 


Proposition 7.3 somehow claims that the standard realization is 
a natural concept.22 To give another justification for the adjective 
“standard,” we shall see that a crystal net ®(X) with “big” symmetry 
is a standard model. 

We assume for simplicity that ® : X —> R® is injective. Let I be 
a group of congruence transformations preserving ®(X) and containing 
the period lattice. Clearly T is a crystallographic group, so that we have 
an exact sequence: 


0—- LT K — 1, 


where L(= R4 N71) is a lattice containing the period lattice, and K C 
O(d) is what is called the point group. Note that the isotropy group I'y 
(x € R®) is identified with a subgroup of K via the (injective) restriction 
of the homomorphism T —> K to Ixy. Under the assumption on @, each 
g €T induces an automorphism of X, so I is regarded as a subgroup of 
Aut(X). 


22 The special features of standard realizations might remind the reader of the claim 
about the golden ratio (1.618033---), a root of 2? = +1, which overemphasizes 
its significance in the history of art, architecture, sculpture, and anatomy. I am 
not going to overrate the significance of standard realizations though quite a few 
structures in nature and art are explained using standard realizations. 
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Theorem 7.7. Suppose that 

(1) the action of the point group K on R®¢ is irreducible, and 

(2) for any verter x € V, the fixed point set for Ve (q)-action on R4 
is {0}, i.e., {xe R®| gx =x (g € Teq))} = {0}. 

Then ® is a standard realization. 


Proof. The group K acts on {v(e)}. eR in a natural manner. Indeed, 
writing (ox) = A(o) G(x) + b(c) (o ET), we have 


vice) = ®(t(ce)) _ ®(o(ce)) = A(o)[® (t(e)) - &(o(e))| = A(o)v(e). 


In view of Proposition 2.1, the assumption (1) assures us that {v(©)} com 
is tight. On the other hand, the vector )).<7, v(e) is (a-invariant. 
Thus, )’eex, Ve) = 0 by the assumption (2). This completes the 


proof. 


As a corollary, we have 


Theorem 7.8. The 1-skeleton of a Coxeter complex is a standard 
model. 


In this theorem, a Coxeter complex means a triangulation of R¢ such 
that 

(a) if we denote by {Ave A the set of d-dimensional simplices in this 
triangulation, then the action of the group I’ generated by all reflections 
fixing facets of A@ (a € A) preserves the triangulation, and 

(b) T’ acts transitively on {NOV ea 

For the proof, we use the fact that I is a crystallographic group,?° 
which acts on the 1-skeleton as well, and that the point group for I is the 
Wey] group associated with an irreducible root system. Clearly the fixed 
point set for ’y-action is {0} for every vertex x, thereby the condition (2) 
being satisfied. Thus one may apply Theorem 7.7 to complete the proof. 

A final remark is in order. The reader might wonder what is the 
practical use of standard crystal models. Straightforwardly speaking, 
three-dimensional standard realizations are purely mathematical out- 
growths of logical reasoning. Even if a standard realization (or its 
deformation) looks realistic, it does not necessarily exist in nature; 
namely it is merely a model of a hypothetical crystal. Once we find 
a hypothetical crystal, however, a systematic prediction of its physical 
properties for appropriate atoms can be carried out by first principles 
calculations used in chemistry. The prediction appealing to the computer 
power encourages (or discourages) material scientists to synthesize the 
hypothetical crystals. 


23 Actually T is isomorphic to a semidirect product of Z4 and the point group. 
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